Overview
Brought to you by YData
Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 785916 |
| Missing cells | 1153445 |
| Missing cells (%) | 8.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 498.0 MiB |
| Average record size in memory | 664.5 B |
Variable types
| Numeric | 8 |
|---|---|
| DateTime | 1 |
| Categorical | 2 |
| Boolean | 3 |
| URL | 2 |
| Text | 2 |
isEdNeed has constant value "True" | Constant |
edInput is highly overall correlated with editor and 1 other fields | High correlation |
editor is highly overall correlated with edInput and 1 other fields | High correlation |
engages is highly overall correlated with likes and 1 other fields | High correlation |
isApproved is highly overall correlated with edInput and 1 other fields | High correlation |
isRT is highly overall correlated with rtUsID | High correlation |
likes is highly overall correlated with engages and 1 other fields | High correlation |
retweets is highly overall correlated with engages and 1 other fields | High correlation |
rtUsID is highly overall correlated with isRT and 1 other fields | High correlation |
usFlwrs is highly overall correlated with rtUsID and 1 other fields | High correlation |
usID is highly overall correlated with usFlwrs | High correlation |
photoUrl has 508020 (64.6%) missing values | Missing |
videoUrl has 645425 (82.1%) missing values | Missing |
engages is highly skewed (γ1 = 62.99060734) | Skewed |
likes is highly skewed (γ1 = 60.77687391) | Skewed |
retweets is highly skewed (γ1 = 101.2684199) | Skewed |
retweets has 26349 (3.4%) zeros | Zeros |
Reproduction
| Analysis started | 2025-03-20 08:21:05.193994 |
|---|---|
| Analysis finished | 2025-03-20 08:21:45.120517 |
| Duration | 39.93 seconds |
| Software version | ydata-profiling vv4.14.0 |
| Download configuration | config.json |
Variables
tweetID
Real number (ℝ)
| Distinct | 744731 |
|---|---|
| Distinct (%) | 94.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1152126 × 1018 |
| Minimum | 53545 |
|---|---|
| Maximum | 1.1541792 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 53545 |
|---|---|
| 5-th percentile | 1.0762031 × 1018 |
| Q1 | 1.0957907 × 1018 |
| median | 1.1164652 × 1018 |
| Q3 | 1.1376757 × 1018 |
| 95-th percentile | 1.151181 × 1018 |
| Maximum | 1.1541792 × 1018 |
| Range | 1.1541792 × 1018 |
| Interquartile range (IQR) | 4.1885075 × 1016 |
Descriptive statistics
| Standard deviation | 2.9252921 × 1016 |
|---|---|
| Coefficient of variation (CV) | 0.026230802 |
| Kurtosis | 197.63384 |
| Mean | 1.1152126 × 1018 |
| Median Absolute Deviation (MAD) | 2.0905278 × 1016 |
| Skewness | -7.0640382 |
| Sum | 3.2824341 × 1018 |
| Variance | 8.557334 × 1032 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.106727752 × 1018 | 11 | < 0.1% |
| 1.078532699 × 1018 | 10 | < 0.1% |
| 1.098670107 × 1018 | 10 | < 0.1% |
| 1.12012213 × 1018 | 9 | < 0.1% |
| 1.108177162 × 1018 | 9 | < 0.1% |
| 1.098774345 × 1018 | 9 | < 0.1% |
| 1.102767908 × 1018 | 9 | < 0.1% |
| 1.122546022 × 1018 | 8 | < 0.1% |
| 1.136842849 × 1018 | 8 | < 0.1% |
| 1.138994643 × 1018 | 8 | < 0.1% |
| Other values (744721) | 785825 |
| Value | Count | Frequency (%) |
| 53545 | 1 | |
| 843792873 | 1 | |
| 845965787 | 1 | |
| 858130806 | 1 | |
| 870600456 | 1 | |
| 874672381 | 1 | |
| 876070278 | 1 | |
| 885858541 | 1 | |
| 888799721 | 1 | |
| 891656593 | 1 |
| Value | Count | Frequency (%) |
| 1.154179233 × 1018 | 1 | |
| 1.154179136 × 1018 | 1 | |
| 1.154179115 × 1018 | 1 | |
| 1.154179111 × 1018 | 1 | |
| 1.154178704 × 1018 | 1 | |
| 1.154178474 × 1018 | 1 | |
| 1.154178263 × 1018 | 1 | |
| 1.154178206 × 1018 | 1 | |
| 1.15417797 × 1018 | 1 | |
| 1.154177719 × 1018 | 1 |
crDate
Date
| Distinct | 686419 |
|---|---|
| Distinct (%) | 87.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.0 MiB |
| Minimum | 2006-11-01 03:33:20 |
|---|---|
| Maximum | 2019-07-24 23:59:07 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
edInput
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 37.9 MiB |
| -1 | |
|---|---|
| 1 | |
| 2 | |
| 4 | 32733 |
| 3 | 8200 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.5377992 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | -1 |
|---|---|
| 2nd row | -1 |
| 3rd row | -1 |
| 4th row | -1 |
| 5th row | -1 |
Common Values
| Value | Count | Frequency (%) |
| -1 | 422665 | |
| 1 | 215577 | |
| 2 | 106741 | 13.6% |
| 4 | 32733 | 4.2% |
| 3 | 8200 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 638242 | |
| 2 | 106741 | 13.6% |
| 4 | 32733 | 4.2% |
| 3 | 8200 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 638242 | |
| - | 422665 | |
| 2 | 106741 | 8.8% |
| 4 | 32733 | 2.7% |
| 3 | 8200 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 785916 | |
| Dash Punctuation | 422665 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 638242 | |
| 2 | 106741 | 13.6% |
| 4 | 32733 | 4.2% |
| 3 | 8200 | 1.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 422665 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1208581 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 638242 | |
| - | 422665 | |
| 2 | 106741 | 8.8% |
| 4 | 32733 | 2.7% |
| 3 | 8200 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1208581 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 638242 | |
| - | 422665 | |
| 2 | 106741 | 8.8% |
| 4 | 32733 | 2.7% |
| 3 | 8200 | 0.7% |
editor
Real number (ℝ)
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2311.9631 |
| Minimum | -1 |
|---|---|
| Maximum | 5101 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 422665 |
| Negative (%) | 53.8% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | -1 |
| Q3 | 5003 |
| 95-th percentile | 5007 |
| Maximum | 5101 |
| Range | 5102 |
| Interquartile range (IQR) | 5004 |
Descriptive statistics
| Standard deviation | 2495.1589 |
|---|---|
| Coefficient of variation (CV) | 1.0792382 |
| Kurtosis | -1.9768935 |
| Mean | 2311.9631 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.15187097 |
| Sum | 1.8170088 × 109 |
| Variance | 6225817.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 422665 | |
| 5004 | 68536 | 8.7% |
| 5003 | 68186 | 8.7% |
| 5002 | 59317 | 7.5% |
| 5001 | 52629 | 6.7% |
| 5006 | 40658 | 5.2% |
| 5007 | 27722 | 3.5% |
| 5005 | 24934 | 3.2% |
| 5008 | 21167 | 2.7% |
| 5101 | 44 | < 0.1% |
| Other values (2) | 58 | < 0.1% |
| Value | Count | Frequency (%) |
| -1 | 422665 | |
| 1001 | 36 | < 0.1% |
| 2001 | 22 | < 0.1% |
| 5001 | 52629 | 6.7% |
| 5002 | 59317 | 7.5% |
| 5003 | 68186 | 8.7% |
| 5004 | 68536 | 8.7% |
| 5005 | 24934 | 3.2% |
| 5006 | 40658 | 5.2% |
| 5007 | 27722 | 3.5% |
| Value | Count | Frequency (%) |
| 5101 | 44 | < 0.1% |
| 5008 | 21167 | 2.7% |
| 5007 | 27722 | |
| 5006 | 40658 | |
| 5005 | 24934 | 3.2% |
| 5004 | 68536 | |
| 5003 | 68186 | |
| 5002 | 59317 | |
| 5001 | 52629 | |
| 2001 | 22 | < 0.1% |
engages
Real number (ℝ)
High correlation  Skewed 
| Distinct | 21684 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1403.6372 |
| Minimum | 0 |
|---|---|
| Maximum | 4152927 |
| Zeros | 3406 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 23 |
| median | 64 |
| Q3 | 250 |
| 95-th percentile | 3070 |
| Maximum | 4152927 |
| Range | 4152927 |
| Interquartile range (IQR) | 227 |
Descriptive statistics
| Standard deviation | 16659.603 |
|---|---|
| Coefficient of variation (CV) | 11.868881 |
| Kurtosis | 8401.259 |
| Mean | 1403.6372 |
| Median Absolute Deviation (MAD) | 52 |
| Skewness | 62.990607 |
| Sum | 1.1031409 × 109 |
| Variance | 2.7754238 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 9949 | 1.3% |
| 10 | 9876 | 1.3% |
| 11 | 9863 | 1.3% |
| 8 | 9840 | 1.3% |
| 9 | 9764 | 1.2% |
| 13 | 9748 | 1.2% |
| 14 | 9715 | 1.2% |
| 15 | 9587 | 1.2% |
| 7 | 9501 | 1.2% |
| 6 | 9472 | 1.2% |
| Other values (21674) | 688601 |
| Value | Count | Frequency (%) |
| 0 | 3406 | 0.4% |
| 1 | 3080 | 0.4% |
| 2 | 3321 | 0.4% |
| 3 | 7719 | |
| 4 | 8695 | |
| 5 | 9123 | |
| 6 | 9472 | |
| 7 | 9501 | |
| 8 | 9840 | |
| 9 | 9764 |
| Value | Count | Frequency (%) |
| 4152927 | 1 | |
| 2447742 | 1 | |
| 2212097 | 1 | |
| 2033066 | 1 | |
| 2013254 | 1 | |
| 1979955 | 1 | |
| 1908389 | 1 | |
| 1685160 | 1 | |
| 1672859 | 1 | |
| 1558032 | 1 |
isApproved
Boolean
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 767.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 558472 | |
| True | 227444 |
isEdNeed
Boolean
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 767.6 KiB |
| True |
|---|
| Value | Count | Frequency (%) |
| True | 785916 |
isRT
Boolean
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 767.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 651001 | |
| True | 134915 | 17.2% |
likes
Real number (ℝ)
High correlation  Skewed 
| Distinct | 19179 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1085.909 |
| Minimum | 0 |
|---|---|
| Maximum | 3206434 |
| Zeros | 4353 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 16 |
| median | 45 |
| Q3 | 184 |
| 95-th percentile | 2347 |
| Maximum | 3206434 |
| Range | 3206434 |
| Interquartile range (IQR) | 168 |
Descriptive statistics
| Standard deviation | 12939.926 |
|---|---|
| Coefficient of variation (CV) | 11.916215 |
| Kurtosis | 7933.3555 |
| Mean | 1085.909 |
| Median Absolute Deviation (MAD) | 37 |
| Skewness | 60.776874 |
| Sum | 8.5343326 × 108 |
| Variance | 1.6744167 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 14423 | 1.8% |
| 6 | 14374 | 1.8% |
| 8 | 14212 | 1.8% |
| 9 | 14188 | 1.8% |
| 5 | 13961 | 1.8% |
| 10 | 13818 | 1.8% |
| 4 | 13596 | 1.7% |
| 11 | 13515 | 1.7% |
| 3 | 13037 | 1.7% |
| 12 | 12948 | 1.6% |
| Other values (19169) | 647844 |
| Value | Count | Frequency (%) |
| 0 | 4353 | 0.6% |
| 1 | 6079 | |
| 2 | 9637 | |
| 3 | 13037 | |
| 4 | 13596 | |
| 5 | 13961 | |
| 6 | 14374 | |
| 7 | 14423 | |
| 8 | 14212 | |
| 9 | 14188 |
| Value | Count | Frequency (%) |
| 3206434 | 1 | |
| 1851624 | 1 | |
| 1590522 | 1 | |
| 1582338 | 1 | |
| 1555878 | 1 | |
| 1506312 | 1 | |
| 1302283 | 1 | |
| 1253834 | 1 | |
| 1117840 | 1 | |
| 1108652 | 1 |
photoUrl
URL
Missing 
| Distinct | 255085 |
|---|---|
| Distinct (%) | 91.8% |
| Missing | 508020 |
| Missing (%) | 64.6% |
| Memory size | 40.9 MiB |
| https://pbs.twimg.com/media/D6v2McUW0AAj24Y.jpg | 105 |
|---|---|
| https://pbs.twimg.com/media/D2HeMOXWoAAPZmG.jpg | 104 |
| https://pbs.twimg.com/media/D0cMdWQU0AAsd6h.jpg | 85 |
| https://pbs.twimg.com/media/CdNsguLW0AEUDX4.jpg | 72 |
| https://pbs.twimg.com/media/D0C5hQIWoAATvDm.jpg | 46 |
| Other values (255080) | |
| (Missing) |
| Value | Count | Frequency (%) |
| https://pbs.twimg.com/media/D6v2McUW0AAj24Y.jpg | 105 | < 0.1% |
| https://pbs.twimg.com/media/D2HeMOXWoAAPZmG.jpg | 104 | < 0.1% |
| https://pbs.twimg.com/media/D0cMdWQU0AAsd6h.jpg | 85 | < 0.1% |
| https://pbs.twimg.com/media/CdNsguLW0AEUDX4.jpg | 72 | < 0.1% |
| https://pbs.twimg.com/media/D0C5hQIWoAATvDm.jpg | 46 | < 0.1% |
| https://pbs.twimg.com/media/D5N9LEoW0AAsUyJ.jpg | 41 | < 0.1% |
| https://pbs.twimg.com/media/D5N7u9kWwAArkna.jpg | 39 | < 0.1% |
| https://pbs.twimg.com/media/D0bw9zmV4AAStDF.jpg | 38 | < 0.1% |
| https://pbs.twimg.com/media/Dy-hihVWsAEa9l8.jpg | 33 | < 0.1% |
| https://pbs.twimg.com/media/D1kQj_kW0AEfAIL.jpg | 32 | < 0.1% |
| Other values (255075) | 277301 | |
| (Missing) | 508020 |
| Value | Count | Frequency (%) |
| https | 277896 | |
| (Missing) | 508020 |
| Value | Count | Frequency (%) |
| pbs.twimg.com | 277896 | |
| (Missing) | 508020 |
| Value | Count | Frequency (%) |
| /media/D6v2McUW0AAj24Y.jpg | 105 | < 0.1% |
| /media/D2HeMOXWoAAPZmG.jpg | 104 | < 0.1% |
| /media/D0cMdWQU0AAsd6h.jpg | 85 | < 0.1% |
| /media/CdNsguLW0AEUDX4.jpg | 72 | < 0.1% |
| /media/D0C5hQIWoAATvDm.jpg | 46 | < 0.1% |
| /media/D5N9LEoW0AAsUyJ.jpg | 41 | < 0.1% |
| /media/D5N7u9kWwAArkna.jpg | 39 | < 0.1% |
| /media/D0bw9zmV4AAStDF.jpg | 38 | < 0.1% |
| /media/Dy-hihVWsAEa9l8.jpg | 33 | < 0.1% |
| /media/D1kQj_kW0AEfAIL.jpg | 32 | < 0.1% |
| Other values (255075) | 277301 | |
| (Missing) | 508020 |
| Value | Count | Frequency (%) |
| 277896 | ||
| (Missing) | 508020 |
| Value | Count | Frequency (%) |
| 277896 | ||
| (Missing) | 508020 |
retweets
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 10589 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 317.7282 |
| Minimum | 0 |
|---|---|
| Maximum | 1335638 |
| Zeros | 26349 |
| Zeros (%) | 3.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 18 |
| Q3 | 65 |
| 95-th percentile | 692 |
| Maximum | 1335638 |
| Range | 1335638 |
| Interquartile range (IQR) | 59 |
Descriptive statistics
| Standard deviation | 4053.2674 |
|---|---|
| Coefficient of variation (CV) | 12.757028 |
| Kurtosis | 22401.35 |
| Mean | 317.7282 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 101.26842 |
| Sum | 2.4970768 × 108 |
| Variance | 16428977 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 32199 | 4.1% |
| 3 | 31872 | 4.1% |
| 4 | 30265 | 3.9% |
| 1 | 30203 | 3.8% |
| 5 | 28492 | 3.6% |
| 6 | 26489 | 3.4% |
| 0 | 26349 | 3.4% |
| 7 | 24729 | 3.1% |
| 8 | 22879 | 2.9% |
| 9 | 20728 | 2.6% |
| Other values (10579) | 511711 |
| Value | Count | Frequency (%) |
| 0 | 26349 | |
| 1 | 30203 | |
| 2 | 32199 | |
| 3 | 31872 | |
| 4 | 30265 | |
| 5 | 28492 | |
| 6 | 26489 | |
| 7 | 24729 | |
| 8 | 22879 | |
| 9 | 20728 |
| Value | Count | Frequency (%) |
| 1335638 | 1 | |
| 946493 | 1 | |
| 621575 | 1 | |
| 596118 | 1 | |
| 564207 | 1 | |
| 511486 | 1 | |
| 473643 | 1 | |
| 457376 | 1 | |
| 450728 | 1 | |
| 382877 | 1 |
rtUsID
Real number (ℝ)
High correlation 
| Distinct | 639 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.3650976 × 1016 |
| Minimum | -1 |
|---|---|
| Maximum | 1.108957 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 651001 |
| Negative (%) | 82.8% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | -1 |
| Q3 | -1 |
| 95-th percentile | 7.007845 × 1017 |
| Maximum | 1.108957 × 1018 |
| Range | 1.108957 × 1018 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.8943837 × 1017 |
|---|---|
| Coefficient of variation (CV) | 4.3398426 |
| Kurtosis | 15.379328 |
| Mean | 4.3650976 × 1016 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.1480429 |
| Sum | -4.9433192 × 1018 |
| Variance | 3.5886895 × 1034 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 651001 | |
| 20562637 | 7381 | 0.9% |
| 34713362 | 5476 | 0.7% |
| 807095 | 4022 | 0.5% |
| 7.814273015 × 1017 | 4019 | 0.5% |
| 1235663514 | 3866 | 0.5% |
| 25453312 | 3785 | 0.5% |
| 8.585161114 × 1017 | 3351 | 0.4% |
| 2866476539 | 2747 | 0.3% |
| 8.657665192 × 1017 | 2709 | 0.3% |
| Other values (629) | 97559 | 12.4% |
| Value | Count | Frequency (%) |
| -1 | 651001 | |
| 428333 | 436 | 0.1% |
| 621523 | 335 | < 0.1% |
| 621583 | 10 | < 0.1% |
| 624413 | 25 | < 0.1% |
| 742143 | 559 | 0.1% |
| 759251 | 188 | < 0.1% |
| 807095 | 4022 | 0.5% |
| 809760 | 9 | < 0.1% |
| 816653 | 83 | < 0.1% |
| Value | Count | Frequency (%) |
| 1.108957041 × 1018 | 4 | < 0.1% |
| 1.101825687 × 1018 | 62 | < 0.1% |
| 1.066832136 × 1018 | 13 | < 0.1% |
| 1.062791226 × 1018 | 10 | < 0.1% |
| 1.061553475 × 1018 | 1 | < 0.1% |
| 1.05844362 × 1018 | 59 | < 0.1% |
| 1.04833502 × 1018 | 27 | < 0.1% |
| 1.047630407 × 1018 | 92 | |
| 1.045103187 × 1018 | 166 | |
| 1.041537348 × 1018 | 9 | < 0.1% |
text
Text
| Distinct | 710522 |
|---|---|
| Distinct (%) | 90.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 216.7 MiB |
Length
| Max length | 968 |
|---|---|
| Median length | 352 |
| Mean length | 130.86807 |
| Min length | 4 |
Unique
| Unique | 666155 ? |
|---|---|
| Unique (%) | 84.8% |
Sample
| 1st row | The immediate impulse for an alliance of the EU's northern states is Brexit https://t.co/nlhUD36hay https://t.co/shwMWpjjuK |
|---|---|
| 2nd row | America's economy is flashing some warning signs, but -- for now -- the labor market appears to be going strong https://t.co/xvCPgtqMzy https://t.co/0sQdzAsME3 |
| 3rd row | Lyft files for what is expected to be one of the hottest IPOs in 2019 https://t.co/qEjyniazlD |
| 4th row | Exporters still waiting to get Rs 6,000 crore worth of input tax credit refunds Many being denied tax refunds by state governments, such as Andhra Pradesh, Uttar Pradesh, Bihar and Chhattisgarh, who say they are cash starved @Subhayan_ism @GST_Council https://t.co/QRBg8b98Rr |
| 5th row | Ride-hailing firm Lyft races to leave Uber behind in IPO chase https://t.co/0qCsdx2LYS https://t.co/gHZLUntYkL |
| Value | Count | Frequency (%) |
| the | 524924 | 3.7% |
| to | 339238 | 2.4% |
| a | 288969 | 2.0% |
| of | 255689 | 1.8% |
| in | 220648 | 1.6% |
| and | 211586 | 1.5% |
| is | 160255 | 1.1% |
| for | 139028 | 1.0% |
| you | 127547 | 0.9% |
| 124246 | 0.9% | |
| Other values (1076361) | 11836996 |
Most occurring characters
| Value | Count | Frequency (%) |
| 13304640 | 12.9% | |
| t | 7991076 | 7.8% |
| e | 7374871 | 7.2% |
| o | 5977026 | 5.8% |
| a | 5137798 | 5.0% |
| s | 5093770 | 5.0% |
| i | 4680619 | 4.6% |
| n | 4483766 | 4.4% |
| r | 4188907 | 4.1% |
| h | 3602956 | 3.5% |
| Other values (2551) | 41015882 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 71198390 | |
| Space Separator | 13317519 | 12.9% |
| Uppercase Letter | 8118612 | 7.9% |
| Other Punctuation | 6807706 | 6.6% |
| Decimal Number | 2123334 | 2.1% |
| Control | 435896 | 0.4% |
| Dash Punctuation | 223873 | 0.2% |
| Other Symbol | 211993 | 0.2% |
| Final Punctuation | 143098 | 0.1% |
| Currency Symbol | 52005 | 0.1% |
| Other values (13) | 218885 | 0.2% |
Most frequent character per category
Other Symbol
| Value | Count | Frequency (%) |
| ⠀ | 25445 | 12.0% |
| 😍 | 11423 | 5.4% |
| 👉 | 8437 | 4.0% |
| 😂 | 6218 | 2.9% |
| 🔥 | 4996 | 2.4% |
| ✅ | 4082 | 1.9% |
| ❤ | 3836 | 1.8% |
| ➡ | 3727 | 1.8% |
| 📷 | 3385 | 1.6% |
| ✨ | 3056 | 1.4% |
| Other values (1213) | 137388 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 7991076 | |
| e | 7374871 | 10.4% |
| o | 5977026 | 8.4% |
| a | 5137798 | 7.2% |
| s | 5093770 | 7.2% |
| i | 4680619 | 6.6% |
| n | 4483766 | 6.3% |
| r | 4188907 | 5.9% |
| h | 3602956 | 5.1% |
| c | 2813206 | 4.0% |
| Other values (405) | 19854395 |
Other Letter
| Value | Count | Frequency (%) |
| 港 | 48 | 3.3% |
| 香 | 48 | 3.3% |
| 送 | 45 | 3.1% |
| 反 | 45 | 3.1% |
| 中 | 45 | 3.1% |
| リ | 37 | 2.5% |
| ツ | 32 | 2.2% |
| ス | 22 | 1.5% |
| º | 21 | 1.4% |
| ト | 21 | 1.4% |
| Other values (392) | 1092 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 576653 | 7.1% |
| S | 499116 | 6.1% |
| A | 482271 | 5.9% |
| I | 432482 | 5.3% |
| C | 427195 | 5.3% |
| M | 385985 | 4.8% |
| B | 375685 | 4.6% |
| P | 334097 | 4.1% |
| W | 332611 | 4.1% |
| D | 330659 | 4.1% |
| Other values (197) | 3941858 |
Modifier Letter
| Value | Count | Frequency (%) |
| ᵉ | 187 | 10.8% |
| ᵒ | 160 | 9.2% |
| ᵗ | 96 | 5.5% |
| ᵃ | 88 | 5.1% |
| ʰ | 85 | 4.9% |
| ˢ | 84 | 4.8% |
| ⁿ | 80 | 4.6% |
| ᵐ | 78 | 4.5% |
| ⁱ | 69 | 4.0% |
| ᵘ | 64 | 3.7% |
| Other values (52) | 742 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 22140 | |
| | | 13056 | |
| + | 6565 | 14.4% |
| ~ | 1830 | 4.0% |
| = | 1252 | 2.7% |
| < | 300 | 0.7% |
| → | 171 | 0.4% |
| ⤵ | 87 | 0.2% |
| × | 72 | 0.2% |
| ↓ | 11 | < 0.1% |
| Other values (33) | 115 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2727718 | |
| . | 1515882 | |
| : | 1040397 | 15.3% |
| # | 407266 | 6.0% |
| , | 383111 | 5.6% |
| ' | 231499 | 3.4% |
| @ | 226945 | 3.3% |
| " | 96126 | 1.4% |
| ! | 73119 | 1.1% |
| ? | 51062 | 0.8% |
| Other values (31) | 54581 | 0.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 309010 | |
| 1 | 274281 | |
| 2 | 242656 | |
| 5 | 199786 | |
| 9 | 198002 | |
| 3 | 190132 | |
| 4 | 180651 | |
| 8 | 178767 | |
| 7 | 175510 | |
| 6 | 174493 | |
| Other values (20) | 46 | < 0.1% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ️ | 19187 | |
| ︎ | 117 | 0.6% |
| ͜ | 14 | 0.1% |
| ̵ | 13 | 0.1% |
| ̶ | 13 | 0.1% |
| ͡ | 11 | 0.1% |
| ิ | 3 | < 0.1% |
| ͞ | 3 | < 0.1% |
| ོ | 2 | < 0.1% |
| ี | 2 | < 0.1% |
| Other values (10) | 11 | 0.1% |
Format
| Value | Count | Frequency (%) |
| | 3186 | |
| | 1287 | |
| | 1275 | |
| | 455 | 6.4% |
| | 203 | 2.8% |
| | 146 | 2.0% |
| | 120 | 1.7% |
| | 91 | 1.3% |
| | 91 | 1.3% |
| | 63 | 0.9% |
| Other values (10) | 218 | 3.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| 🏻 | 2175 | |
| 🏼 | 1316 | |
| 🏽 | 382 | 7.9% |
| ^ | 248 | 5.1% |
|  ̄ | 229 | 4.7% |
| 🏾 | 202 | 4.2% |
| ` | 142 | 2.9% |
| ¯ | 56 | 1.2% |
| ´ | 38 | 0.8% |
| 🏿 | 35 | 0.7% |
| Other values (6) | 21 | 0.4% |
Other Number
| Value | Count | Frequency (%) |
| ² | 23 | |
| ½ | 23 | |
| ⁰ | 12 | |
| ¼ | 6 | 7.3% |
| ³ | 4 | 4.9% |
| ⅓ | 3 | 3.7% |
| ③ | 2 | 2.4% |
| ② | 2 | 2.4% |
| ① | 2 | 2.4% |
| ₂ | 2 | 2.4% |
| Other values (3) | 3 | 3.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 25891 | |
| [ | 6074 | 18.5% |
| { | 882 | 2.7% |
| „ | 17 | 0.1% |
| ‚ | 7 | < 0.1% |
| ( | 7 | < 0.1% |
| 【 | 3 | < 0.1% |
| 《 | 2 | < 0.1% |
| ︿ | 2 | < 0.1% |
| ༼ | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 13304640 | ||
| 12370 | 0.1% | |
| 339 | < 0.1% | |
| 107 | < 0.1% | |
| 27 | < 0.1% | |
| 14 | < 0.1% | |
| 10 | < 0.1% | |
| 7 | < 0.1% | |
| 5 | < 0.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 50659 | |
| £ | 1195 | 2.3% |
| € | 118 | 0.2% |
| ₿ | 17 | < 0.1% |
| ¥ | 7 | < 0.1% |
| ¢ | 5 | < 0.1% |
| ฿ | 2 | < 0.1% |
| ₹ | 1 | < 0.1% |
| ₣ | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 435833 | ||
| | 22 | < 0.1% |
| 20 | < 0.1% | |
| | 8 | < 0.1% |
| | 4 | < 0.1% |
| | 4 | < 0.1% |
| | 4 | < 0.1% |
| | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 194513 | |
| — | 22073 | 9.9% |
| – | 6136 | 2.7% |
| ― | 1130 | 0.5% |
| 〰 | 16 | < 0.1% |
| ‑ | 2 | < 0.1% |
| ‒ | 2 | < 0.1% |
| ‐ | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 28291 | |
| ] | 6042 | 17.1% |
| } | 903 | 2.6% |
| ︶ | 10 | < 0.1% |
| ) | 7 | < 0.1% |
| 】 | 4 | < 0.1% |
| 》 | 2 | < 0.1% |
| ༽ | 2 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 117374 | |
| ” | 25609 | 17.9% |
| » | 112 | 0.1% |
| › | 3 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 32598 | |
| _ | 224 | 0.7% |
| ‿ | 5 | < 0.1% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 26497 | |
| ‘ | 10869 | |
| « | 108 | 0.3% |
Private Use
| Value | Count | Frequency (%) |
| | 4 | |
| | 1 | 16.7% |
| | 1 | 16.7% |
Enclosing Mark
| Value | Count | Frequency (%) |
| ⃣ | 199 | |
| ҉ | 4 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 79313384 | |
| Common | 23488126 | 22.8% |
| Braille | 25445 | < 0.1% |
| Inherited | 22747 | < 0.1% |
| Han | 669 | < 0.1% |
| Katakana | 331 | < 0.1% |
| Hiragana | 126 | < 0.1% |
| Hangul | 110 | < 0.1% |
| Cyrillic | 86 | < 0.1% |
| Arabic | 86 | < 0.1% |
| Other values (13) | 201 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 13304640 | ||
| / | 2727718 | 11.6% |
| . | 1515882 | 6.5% |
| : | 1040397 | 4.4% |
| 435833 | 1.9% | |
| # | 407266 | 1.7% |
| , | 383111 | 1.6% |
| 0 | 309010 | 1.3% |
| 1 | 274281 | 1.2% |
| 2 | 242656 | 1.0% |
| Other values (1828) | 2847332 | 12.1% |
Latin
| Value | Count | Frequency (%) |
| t | 7991076 | 10.1% |
| e | 7374871 | 9.3% |
| o | 5977026 | 7.5% |
| a | 5137798 | 6.5% |
| s | 5093770 | 6.4% |
| i | 4680619 | 5.9% |
| n | 4483766 | 5.7% |
| r | 4188907 | 5.3% |
| h | 3602956 | 4.5% |
| c | 2813206 | 3.5% |
| Other values (241) | 27969389 |
Han
| Value | Count | Frequency (%) |
| 港 | 48 | 7.2% |
| 香 | 48 | 7.2% |
| 送 | 45 | 6.7% |
| 反 | 45 | 6.7% |
| 中 | 45 | 6.7% |
| 花 | 20 | 3.0% |
| 挺 | 15 | 2.2% |
| 孤 | 15 | 2.2% |
| 物 | 13 | 1.9% |
| 大 | 12 | 1.8% |
| Other values (185) | 363 |
Hangul
| Value | Count | Frequency (%) |
| ㅤ | 14 | 12.7% |
| ㅅ | 12 | 10.9% |
| 북 | 6 | 5.5% |
| 담 | 6 | 5.5% |
| 회 | 6 | 5.5% |
| 미 | 5 | 4.5% |
| 수 | 3 | 2.7% |
| 남 | 3 | 2.7% |
| 한 | 2 | 1.8% |
| 혼 | 2 | 1.8% |
| Other values (39) | 51 |
Katakana
| Value | Count | Frequency (%) |
| リ | 37 | 11.2% |
| ツ | 32 | 9.7% |
| ス | 22 | 6.6% |
| ト | 21 | 6.3% |
| ア | 18 | 5.4% |
| マ | 17 | 5.1% |
| ン | 16 | 4.8% |
| ド | 16 | 4.8% |
| メ | 13 | 3.9% |
| イ | 12 | 3.6% |
| Other values (36) | 127 |
Cyrillic
| Value | Count | Frequency (%) |
| ү | 28 | |
| С | 7 | 8.1% |
| с | 6 | 7.0% |
| ғ | 4 | 4.7% |
| ҉ | 4 | 4.7% |
| и | 4 | 4.7% |
| о | 4 | 4.7% |
| а | 3 | 3.5% |
| Х | 2 | 2.3% |
| в | 2 | 2.3% |
| Other values (20) | 22 |
Thai
| Value | Count | Frequency (%) |
| ร | 9 | |
| ง | 8 | 12.3% |
| า | 6 | 9.2% |
| ช | 4 | 6.2% |
| ิ | 3 | 4.6% |
| เ | 3 | 4.6% |
| ก | 3 | 4.6% |
| ท | 3 | 4.6% |
| พ | 3 | 4.6% |
| ะ | 2 | 3.1% |
| Other values (17) | 21 |
Hiragana
| Value | Count | Frequency (%) |
| り | 17 | |
| づ | 15 | |
| の | 14 | |
| を | 10 | 7.9% |
| が | 9 | 7.1% |
| し | 8 | 6.3% |
| い | 6 | 4.8% |
| と | 5 | 4.0% |
| る | 4 | 3.2% |
| き | 4 | 3.2% |
| Other values (16) | 34 |
Arabic
| Value | Count | Frequency (%) |
| و | 11 | |
| ل | 11 | |
| ا | 11 | |
| ي | 9 | |
| ن | 7 | 8.1% |
| ر | 6 | 7.0% |
| م | 4 | 4.7% |
| ئ | 3 | 3.5% |
| س | 3 | 3.5% |
| ج | 3 | 3.5% |
| Other values (15) | 18 |
Canadian_Aboriginal
| Value | Count | Frequency (%) |
| ᑦ | 4 | 10.3% |
| ᐅ | 4 | 10.3% |
| ᕕ | 3 | 7.7% |
| ᐊ | 3 | 7.7% |
| ᕗ | 3 | 7.7% |
| ᐛ | 2 | 5.1% |
| ᖅ | 2 | 5.1% |
| ᑭ | 2 | 5.1% |
| ᑐ | 2 | 5.1% |
| ᑕ | 2 | 5.1% |
| Other values (12) | 12 |
Hebrew
| Value | Count | Frequency (%) |
| א | 2 | |
| ש | 1 | |
| ִ | 1 | |
| ׁ | 1 | |
| ֖ | 1 | |
| י | 1 | |
| ת | 1 | |
| ֵ | 1 | |
| ב | 1 | |
| ְ | 1 | |
| Other values (2) | 2 |
Inherited
| Value | Count | Frequency (%) |
| ️ | 19187 | |
| | 3186 | 14.0% |
| ⃣ | 199 | 0.9% |
| ︎ | 117 | 0.5% |
| ͜ | 14 | 0.1% |
| ̵ | 13 | 0.1% |
| ̶ | 13 | 0.1% |
| ͡ | 11 | < 0.1% |
| | 4 | < 0.1% |
| ͞ | 3 | < 0.1% |
Greek
| Value | Count | Frequency (%) |
| π | 7 | |
| ω | 3 | |
| α | 3 | |
| θ | 1 | 5.3% |
| ᵧ | 1 | 5.3% |
| γ | 1 | 5.3% |
| β | 1 | 5.3% |
| Λ | 1 | 5.3% |
| μ | 1 | 5.3% |
Devanagari
| Value | Count | Frequency (%) |
| ट | 1 | |
| आ | 1 | |
| र | 1 | |
| ् | 1 | |
| य | 1 | |
| भ | 1 |
Unknown
| Value | Count | Frequency (%) |
| | 4 | |
| | 1 | 16.7% |
| | 1 | 16.7% |
Tibetan
| Value | Count | Frequency (%) |
| ོ | 2 | |
| ༼ | 2 | |
| ༽ | 2 |
Egyptian_Hieroglyphs
| Value | Count | Frequency (%) |
| 𓅃 | 1 | |
| 𓂀 | 1 | |
| 𓁁 | 1 |
Braille
| Value | Count | Frequency (%) |
| ⠀ | 25445 |
Armenian
| Value | Count | Frequency (%) |
| ֎ | 24 |
Oriya
| Value | Count | Frequency (%) |
| ୧ | 7 |
Georgian
| Value | Count | Frequency (%) |
| ღ | 7 |
Lao
| Value | Count | Frequency (%) |
| ຈ | 4 |
Kannada
| Value | Count | Frequency (%) |
| ಠ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 102363049 | |
| Punctuation | 222247 | 0.2% |
| None | 130473 | 0.1% |
| Emoticons | 40413 | < 0.1% |
| Braille | 25445 | < 0.1% |
| VS | 19304 | < 0.1% |
| Dingbats | 17815 | < 0.1% |
| Enclosed Alphanum Sup | 10271 | < 0.1% |
| Misc Symbols | 7807 | < 0.1% |
| Math Alphanum | 5128 | < 0.1% |
| Other values (46) | 9359 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 13304640 | 13.0% | |
| t | 7991076 | 7.8% |
| e | 7374871 | 7.2% |
| o | 5977026 | 5.8% |
| a | 5137798 | 5.0% |
| s | 5093770 | 5.0% |
| i | 4680619 | 4.6% |
| n | 4483766 | 4.4% |
| r | 4188907 | 4.1% |
| h | 3602956 | 3.5% |
| Other values (87) | 40527620 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 117374 | |
| “ | 26497 | 11.9% |
| ” | 25609 | 11.5% |
| — | 22073 | 9.9% |
| ‘ | 10869 | 4.9% |
| – | 6136 | 2.8% |
| | 3186 | 1.4% |
| … | 3091 | 1.4% |
| • | 2488 | 1.1% |
| | 1287 | 0.6% |
| Other values (24) | 3637 | 1.6% |
Braille
| Value | Count | Frequency (%) |
| ⠀ | 25445 |
VS
| Value | Count | Frequency (%) |
| ️ | 19187 | |
| ︎ | 117 | 0.6% |
None
| Value | Count | Frequency (%) |
| 12370 | 9.5% | |
| 👉 | 8437 | 6.5% |
| 🔥 | 4996 | 3.8% |
| 📷 | 3385 | 2.6% |
| 🤣 | 2659 | 2.0% |
| 📸 | 2238 | 1.7% |
| 🏻 | 2175 | 1.7% |
| ⬇ | 2159 | 1.7% |
| 👇 | 2075 | 1.6% |
| é | 1925 | 1.5% |
| Other values (1058) | 88054 |
Emoticons
| Value | Count | Frequency (%) |
| 😍 | 11423 | |
| 😂 | 6218 | |
| 😊 | 2230 | 5.5% |
| 😫 | 1539 | 3.8% |
| 😭 | 1484 | 3.7% |
| 😱 | 1447 | 3.6% |
| 😘 | 1335 | 3.3% |
| 😲 | 1132 | 2.8% |
| 🙌 | 1130 | 2.8% |
| 🙍 | 918 | 2.3% |
| Other values (70) | 11557 |
Dingbats
| Value | Count | Frequency (%) |
| ✅ | 4082 | |
| ❤ | 3836 | |
| ➡ | 3727 | |
| ✨ | 3056 | |
| ❄ | 995 | 5.6% |
| ❌ | 433 | 2.4% |
| ✈ | 340 | 1.9% |
| ✔ | 324 | 1.8% |
| ✌ | 207 | 1.2% |
| ✊ | 116 | 0.7% |
| Other values (39) | 699 | 3.9% |
Misc Symbols
| Value | Count | Frequency (%) |
| ♂ | 1505 | |
| ♀ | 901 | |
| ⚡ | 820 | 10.5% |
| ☀ | 662 | 8.5% |
| ♥ | 390 | 5.0% |
| ☁ | 376 | 4.8% |
| ☕ | 362 | 4.6% |
| ☺ | 304 | 3.9% |
| ★ | 184 | 2.4% |
| ⚽ | 126 | 1.6% |
| Other values (81) | 2177 |
Enclosed Alphanum Sup
| Value | Count | Frequency (%) |
| 🇸 | 1179 | 11.5% |
| 🇺 | 1125 | 11.0% |
| 🇮 | 773 | 7.5% |
| 🇷 | 767 | 7.5% |
| 🇹 | 564 | 5.5% |
| 🇳 | 560 | 5.5% |
| 🇦 | 537 | 5.2% |
| 🇨 | 489 | 4.8% |
| 🇬 | 463 | 4.5% |
| 🇪 | 442 | 4.3% |
| Other values (22) | 3372 |
Specials
| Value | Count | Frequency (%) |
|  | 785 | |
| � | 10 | 1.3% |
Geometric Shapes
| Value | Count | Frequency (%) |
| ▫ | 519 | |
| ▶ | 342 | |
| ▪ | 193 | 15.3% |
| ● | 61 | 4.8% |
| ■ | 25 | 2.0% |
| ▬ | 18 | 1.4% |
| ► | 18 | 1.4% |
| ◡ | 15 | 1.2% |
| ◤ | 9 | 0.7% |
| ◾ | 9 | 0.7% |
| Other values (17) | 56 | 4.4% |
Misc Technical
| Value | Count | Frequency (%) |
| ⏩ | 322 | |
| ⏰ | 191 | |
| ⏳ | 87 | 10.8% |
| ⏬ | 55 | 6.8% |
| ⌚ | 39 | 4.9% |
| ⏱ | 33 | 4.1% |
| ⌛ | 25 | 3.1% |
| ⌒ | 20 | 2.5% |
| ⏲ | 8 | 1.0% |
| ⌨ | 6 | 0.7% |
| Other values (10) | 17 | 2.1% |
Math Alphanum
| Value | Count | Frequency (%) |
| 𝑒 | 209 | 4.1% |
| 𝑖 | 146 | 2.8% |
| 𝑎 | 144 | 2.8% |
| 𝑛 | 129 | 2.5% |
| 𝑜 | 127 | 2.5% |
| 𝑡 | 125 | 2.4% |
| 𝑠 | 124 | 2.4% |
| 𝑟 | 117 | 2.3% |
| 𝚎 | 88 | 1.7% |
| 𝚘 | 78 | 1.5% |
| Other values (390) | 3841 |
Phonetic Ext
| Value | Count | Frequency (%) |
| ᵉ | 187 | |
| ᵒ | 160 | |
| ᵗ | 96 | 8.8% |
| ᵃ | 88 | 8.1% |
| ᵐ | 78 | 7.2% |
| ᵘ | 64 | 5.9% |
| ᵍ | 55 | 5.1% |
| ᵈ | 39 | 3.6% |
| ᵏ | 36 | 3.3% |
| ᵇ | 24 | 2.2% |
| Other values (34) | 262 |
Box Drawing
| Value | Count | Frequency (%) |
| ┃ | 187 | |
| ┻ | 134 | |
| ─ | 126 | |
| ┳ | 122 | |
| ━ | 121 | |
| ┛ | 117 | |
| ╲ | 80 | 5.8% |
| ╱ | 80 | 5.8% |
| ┓ | 78 | 5.6% |
| ┏ | 64 | 4.6% |
| Other values (15) | 272 |
Arrows
| Value | Count | Frequency (%) |
| → | 171 | |
| ⇠ | 24 | 9.0% |
| ⇢ | 22 | 8.3% |
| ↓ | 11 | 4.1% |
| ↔ | 10 | 3.8% |
| ↘ | 7 | 2.6% |
| ⇄ | 5 | 1.9% |
| ↗ | 3 | 1.1% |
| ↪ | 3 | 1.1% |
| ↠ | 2 | 0.8% |
| Other values (7) | 8 | 3.0% |
Block Elements
| Value | Count | Frequency (%) |
| ▔ | 140 | |
| █ | 87 | |
| ▀ | 31 | 9.0% |
| ▄ | 30 | 8.7% |
| ▏ | 20 | 5.8% |
| ▕ | 19 | 5.5% |
| ▂ | 6 | 1.7% |
| ▐ | 4 | 1.2% |
| ▌ | 3 | 0.9% |
| ▓ | 3 | 0.9% |
Tags
| Value | Count | Frequency (%) |
| | 120 | |
| | 91 | |
| | 91 | |
| | 63 | |
| | 48 | 8.7% |
| | 47 | 8.5% |
| | 29 | 5.3% |
| | 29 | 5.3% |
| | 15 | 2.7% |
| | 15 | 2.7% |
| Other values (2) | 2 | 0.4% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 118 | |
| ₿ | 17 | 12.4% |
| ₹ | 1 | 0.7% |
| ₣ | 1 | 0.7% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 95 | |
| ℮ | 75 | |
| ℎ | 73 | |
| ℕ | 12 | 4.5% |
| ℹ | 7 | 2.6% |
| ℝ | 2 | 0.7% |
| ℂ | 1 | 0.4% |
| ℯ | 1 | 0.4% |
| ℉ | 1 | 0.4% |
| ℃ | 1 | 0.4% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʰ | 85 | |
| ˢ | 84 | |
| ʳ | 61 | |
| ˡ | 58 | |
| ʷ | 55 | |
| ʸ | 45 | |
| ʻ | 10 | 2.3% |
| ˚ | 8 | 1.9% |
| ʺ | 7 | 1.6% |
| ʲ | 5 | 1.2% |
| Other values (7) | 13 | 3.0% |
CJK
| Value | Count | Frequency (%) |
| 港 | 48 | 7.2% |
| 香 | 48 | 7.2% |
| 送 | 45 | 6.7% |
| 反 | 45 | 6.7% |
| 中 | 45 | 6.7% |
| 花 | 20 | 3.0% |
| 挺 | 15 | 2.2% |
| 孤 | 15 | 2.2% |
| 物 | 13 | 1.9% |
| 大 | 12 | 1.8% |
| Other values (185) | 363 |
Katakana
| Value | Count | Frequency (%) |
| ー | 46 | 11.5% |
| リ | 37 | 9.3% |
| ツ | 32 | 8.0% |
| ・ | 26 | 6.5% |
| ス | 22 | 5.5% |
| ト | 21 | 5.3% |
| ア | 18 | 4.5% |
| マ | 17 | 4.3% |
| ン | 16 | 4.0% |
| ド | 16 | 4.0% |
| Other values (36) | 148 |
Phonetic Ext Sup
| Value | Count | Frequency (%) |
| ᶠ | 39 | |
| ᶜ | 38 | |
| ᶦ | 24 | |
| ᶻ | 3 | 2.9% |
Cyrillic
| Value | Count | Frequency (%) |
| ү | 28 | |
| С | 7 | 8.1% |
| с | 6 | 7.0% |
| ғ | 4 | 4.7% |
| ҉ | 4 | 4.7% |
| и | 4 | 4.7% |
| о | 4 | 4.7% |
| а | 3 | 3.5% |
| Х | 2 | 2.3% |
| в | 2 | 2.3% |
| Other values (20) | 22 |
Armenian
| Value | Count | Frequency (%) |
| ֎ | 24 |
IPA Ext
| Value | Count | Frequency (%) |
| ʀ | 20 | |
| ʜ | 20 | |
| ʖ | 11 | |
| ɪ | 8 | 7.8% |
| ʇ | 7 | 6.8% |
| ɹ | 5 | 4.9% |
| ə | 5 | 4.9% |
| ɥ | 4 | 3.9% |
| ɐ | 4 | 3.9% |
| ɴ | 3 | 2.9% |
| Other values (11) | 16 |
Hiragana
| Value | Count | Frequency (%) |
| り | 17 | |
| づ | 15 | 11.2% |
| の | 14 | 10.4% |
| を | 10 | 7.5% |
| が | 9 | 6.7% |
| し | 8 | 6.0% |
| ゜ | 8 | 6.0% |
| い | 6 | 4.5% |
| と | 5 | 3.7% |
| る | 4 | 3.0% |
| Other values (17) | 38 |
Playing Cards
| Value | Count | Frequency (%) |
| 🃏 | 17 |
Compat Jamo
| Value | Count | Frequency (%) |
| ㅤ | 14 | |
| ㅅ | 12 |
Diacriticals
| Value | Count | Frequency (%) |
| ͜ | 14 | |
| ̵ | 13 | |
| ̶ | 13 | |
| ͡ | 11 | |
| ͞ | 3 | 5.6% |
Arabic
| Value | Count | Frequency (%) |
| و | 11 | |
| ل | 11 | |
| ا | 11 | |
| ي | 9 | |
| ن | 7 | 8.0% |
| ر | 6 | 6.9% |
| م | 4 | 4.6% |
| ئ | 3 | 3.4% |
| س | 3 | 3.4% |
| ج | 3 | 3.4% |
| Other values (16) | 19 |
CJK Compat Forms
| Value | Count | Frequency (%) |
| ︶ | 10 | |
| ︿ | 2 | 15.4% |
| ︻ | 1 | 7.7% |
Thai
| Value | Count | Frequency (%) |
| ร | 9 | 13.4% |
| ง | 8 | 11.9% |
| า | 6 | 9.0% |
| ช | 4 | 6.0% |
| ิ | 3 | 4.5% |
| เ | 3 | 4.5% |
| ก | 3 | 4.5% |
| ท | 3 | 4.5% |
| พ | 3 | 4.5% |
| ะ | 2 | 3.0% |
| Other values (18) | 23 |
Alphabetic PF
| Value | Count | Frequency (%) |
| fi | 8 | |
| fl | 1 | 11.1% |
Oriya
| Value | Count | Frequency (%) |
| ୧ | 7 |
Georgian
| Value | Count | Frequency (%) |
| ღ | 7 |
Hangul
| Value | Count | Frequency (%) |
| 북 | 6 | 7.1% |
| 담 | 6 | 7.1% |
| 회 | 6 | 7.1% |
| 미 | 5 | 6.0% |
| 수 | 3 | 3.6% |
| 남 | 3 | 3.6% |
| 한 | 2 | 2.4% |
| 혼 | 2 | 2.4% |
| 정 | 2 | 2.4% |
| 이 | 2 | 2.4% |
| Other values (37) | 47 |
Math Operators
| Value | Count | Frequency (%) |
| ∽ | 6 | |
| ∙ | 6 | |
| ≠ | 4 | |
| ∧ | 4 | |
| ∩ | 4 | |
| ⊃ | 4 | |
| ∞ | 4 | |
| − | 2 | 5.0% |
| ≈ | 2 | 5.0% |
| ⊂ | 1 | 2.5% |
| Other values (3) | 3 |
Lao
| Value | Count | Frequency (%) |
| ຈ | 4 |
UCAS
| Value | Count | Frequency (%) |
| ᑦ | 4 | 10.3% |
| ᐅ | 4 | 10.3% |
| ᕕ | 3 | 7.7% |
| ᐊ | 3 | 7.7% |
| ᕗ | 3 | 7.7% |
| ᐛ | 2 | 5.1% |
| ᖅ | 2 | 5.1% |
| ᑭ | 2 | 5.1% |
| ᑐ | 2 | 5.1% |
| ᑕ | 2 | 5.1% |
| Other values (12) | 12 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ấ | 3 | |
| ễ | 3 | |
| ả | 1 | 12.5% |
| ố | 1 | 12.5% |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 3 |
Kannada
| Value | Count | Frequency (%) |
| ಠ | 2 |
Enclosed Alphanum
| Value | Count | Frequency (%) |
| ③ | 2 | |
| ② | 2 | |
| ① | 2 | |
| Ⓝ | 1 | |
| Ⓥ | 1 |
Tibetan
| Value | Count | Frequency (%) |
| ོ | 2 | |
| ༼ | 2 | |
| ༽ | 2 |
Hebrew
| Value | Count | Frequency (%) |
| א | 2 | |
| ש | 1 | |
| ִ | 1 | |
| ׁ | 1 | |
| ֖ | 1 | |
| י | 1 | |
| ת | 1 | |
| ֵ | 1 | |
| ב | 1 | |
| ְ | 1 | |
| Other values (2) | 2 |
Small Forms
| Value | Count | Frequency (%) |
| ﹖ | 2 |
CJK Compat
| Value | Count | Frequency (%) |
| ㎡ | 1 |
Devanagari
| Value | Count | Frequency (%) |
| ट | 1 | |
| आ | 1 | |
| र | 1 | |
| ् | 1 | |
| य | 1 | |
| भ | 1 |
Enclosed Ideographic Sup
| Value | Count | Frequency (%) |
| 🉐 | 1 |
Mahjong
| Value | Count | Frequency (%) |
| 🀄 | 1 |
Egyptian Hieroglyphs
| Value | Count | Frequency (%) |
| 𓅃 | 1 | |
| 𓂀 | 1 | |
| 𓁁 | 1 |
PUA
| Value | Count | Frequency (%) |
| | 1 | |
| | 1 |
Geometric Shapes Ext
| Value | Count | Frequency (%) |
| 🟡 | 1 |
Domino
| Value | Count | Frequency (%) |
| 🀹 | 1 |
topicName
Categorical
| Distinct | 42 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 43.2 MiB |
| Business | |
|---|---|
| News | |
| Motivational | |
| Technology | |
| Design & Architecture | |
| Other values (37) |
Length
| Max length | 24 |
|---|---|
| Median length | 16 |
| Mean length | 8.6531054 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Business |
|---|---|
| 2nd row | Business |
| 3rd row | Business |
| 4th row | Business |
| 5th row | Business |
Common Values
| Value | Count | Frequency (%) |
| Business | 164602 | |
| News | 131973 | |
| Motivational | 84750 | |
| Technology | 47679 | 6.1% |
| Design & Architecture | 44987 | 5.7% |
| Cryptocurrency | 38623 | 4.9% |
| Art | 36697 | 4.7% |
| Interesting | 28615 | 3.6% |
| Animal | 28202 | 3.6% |
| Memes | 26349 | 3.4% |
| Other values (32) | 153439 |
Length
| Value | Count | Frequency (%) |
| business | 164602 | |
| news | 132252 | |
| motivational | 84750 | 9.1% |
| 67473 | 7.2% | |
| technology | 47679 | 5.1% |
| design | 44987 | 4.8% |
| architecture | 44987 | 4.8% |
| cryptocurrency | 38623 | 4.1% |
| art | 36697 | 3.9% |
| interesting | 28615 | 3.1% |
| Other values (39) | 242297 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 758810 | 11.2% |
| e | 699171 | 10.3% |
| i | 520692 | 7.7% |
| n | 515001 | 7.6% |
| t | 478590 | 7.0% |
| o | 386155 | 5.7% |
| r | 378225 | 5.6% |
| a | 321111 | 4.7% |
| u | 290216 | 4.3% |
| c | 242622 | 3.6% |
| Other values (33) | 2210021 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5675787 | |
| Uppercase Letter | 910308 | 13.4% |
| Space Separator | 147046 | 2.2% |
| Other Punctuation | 67473 | 1.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 758810 | |
| e | 699171 | |
| i | 520692 | |
| n | 515001 | |
| t | 478590 | |
| o | 386155 | 6.8% |
| r | 378225 | 6.7% |
| a | 321111 | 5.7% |
| u | 290216 | 5.1% |
| c | 242622 | 4.3% |
| Other values (12) | 1085194 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 164680 | |
| N | 158065 | |
| M | 114072 | |
| A | 109886 | |
| D | 67511 | |
| C | 61015 | 6.7% |
| T | 60894 | 6.7% |
| I | 51125 | 5.6% |
| P | 35247 | 3.9% |
| F | 25569 | 2.8% |
| Other values (9) | 62244 | 6.8% |
Space Separator
| Value | Count | Frequency (%) |
| 147046 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 67473 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6586095 | |
| Common | 214519 | 3.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 758810 | 11.5% |
| e | 699171 | 10.6% |
| i | 520692 | 7.9% |
| n | 515001 | 7.8% |
| t | 478590 | 7.3% |
| o | 386155 | 5.9% |
| r | 378225 | 5.7% |
| a | 321111 | 4.9% |
| u | 290216 | 4.4% |
| c | 242622 | 3.7% |
| Other values (31) | 1995502 |
Common
| Value | Count | Frequency (%) |
| 147046 | ||
| & | 67473 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6800614 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 758810 | 11.2% |
| e | 699171 | 10.3% |
| i | 520692 | 7.7% |
| n | 515001 | 7.6% |
| t | 478590 | 7.0% |
| o | 386155 | 5.7% |
| r | 378225 | 5.6% |
| a | 321111 | 4.7% |
| u | 290216 | 4.3% |
| c | 242622 | 3.6% |
| Other values (33) | 2210021 |
usFlwrs
Real number (ℝ)
High correlation 
| Distinct | 343319 |
|---|---|
| Distinct (%) | 43.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4472701.3 |
| Minimum | 0 |
|---|---|
| Maximum | 1.0573845 × 108 |
| Zeros | 60 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2472 |
| Q1 | 142628 |
| median | 966826.5 |
| Q3 | 3603135 |
| 95-th percentile | 20591725 |
| Maximum | 1.0573845 × 108 |
| Range | 1.0573845 × 108 |
| Interquartile range (IQR) | 3460507 |
Descriptive statistics
| Standard deviation | 9149778.1 |
|---|---|
| Coefficient of variation (CV) | 2.045694 |
| Kurtosis | 11.068883 |
| Mean | 4472701.3 |
| Median Absolute Deviation (MAD) | 961332.5 |
| Skewness | 3.2684972 |
| Sum | 3.5151675 × 1012 |
| Variance | 8.371844 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 590529 | 234 | < 0.1% |
| 16262182 | 217 | < 0.1% |
| 589918 | 211 | < 0.1% |
| 2150418 | 191 | < 0.1% |
| 2133253 | 187 | < 0.1% |
| 62 | 181 | < 0.1% |
| 590441 | 162 | < 0.1% |
| 590390 | 150 | < 0.1% |
| 16235969 | 148 | < 0.1% |
| 65 | 147 | < 0.1% |
| Other values (343309) | 784088 |
| Value | Count | Frequency (%) |
| 0 | 60 | |
| 1 | 49 | |
| 2 | 59 | |
| 3 | 53 | |
| 4 | 61 | |
| 5 | 59 | |
| 6 | 67 | |
| 7 | 51 | |
| 8 | 59 | |
| 9 | 66 |
| Value | Count | Frequency (%) |
| 105738450 | 1 | |
| 105246986 | 1 | |
| 105246692 | 1 | |
| 105245851 | 1 | |
| 105017221 | 1 | |
| 103922950 | 1 | |
| 103873045 | 1 | |
| 103873020 | 2 | |
| 103744375 | 1 | |
| 103670138 | 2 |
usID
Real number (ℝ)
High correlation 
| Distinct | 22516 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0852761 × 1017 |
| Minimum | 12 |
|---|---|
| Maximum | 1.153467 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.0 MiB |
Quantile statistics
| Minimum | 12 |
|---|---|
| 5-th percentile | 1652541 |
| Q1 | 15513767 |
| median | 36184220 |
| Q3 | 9.545908 × 108 |
| 95-th percentile | 9.4275497 × 1017 |
| Maximum | 1.153467 × 1018 |
| Range | 1.153467 × 1018 |
| Interquartile range (IQR) | 9.3907704 × 108 |
Descriptive statistics
| Standard deviation | 3.0104857 × 1017 |
|---|---|
| Coefficient of variation (CV) | 2.7739352 |
| Kurtosis | 4.195324 |
| Mean | 1.0852761 × 1017 |
| Median Absolute Deviation (MAD) | 35377125 |
| Skewness | 2.460755 |
| Sum | -4.1567913 × 1018 |
| Variance | 9.063024 × 1034 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34713362 | 62033 | 7.9% |
| 200583835 | 37446 | 4.8% |
| 20562637 | 36255 | 4.6% |
| 14763734 | 22131 | 2.8% |
| 3108351 | 20364 | 2.6% |
| 807095 | 16862 | 2.1% |
| 14511951 | 15798 | 2.0% |
| 1652541 | 14897 | 1.9% |
| 701725963 | 14727 | 1.9% |
| 16896485 | 14412 | 1.8% |
| Other values (22506) | 530991 |
| Value | Count | Frequency (%) |
| 12 | 8 | |
| 62 | 1 | < 0.1% |
| 767 | 7 | |
| 1585 | 3 | < 0.1% |
| 1605 | 3 | < 0.1% |
| 3475 | 1 | < 0.1% |
| 4816 | 2 | < 0.1% |
| 7846 | 1 | < 0.1% |
| 10221 | 1 | < 0.1% |
| 10437 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1.153466973 × 1018 | 1 | < 0.1% |
| 1.152826439 × 1018 | 1 | < 0.1% |
| 1.152685127 × 1018 | 1 | < 0.1% |
| 1.152301169 × 1018 | 1 | < 0.1% |
| 1.152284678 × 1018 | 2 | < 0.1% |
| 1.151881037 × 1018 | 1 | < 0.1% |
| 1.15169615 × 1018 | 1 | < 0.1% |
| 1.151607491 × 1018 | 1 | < 0.1% |
| 1.151507737 × 1018 | 16 | |
| 1.151473541 × 1018 | 1 | < 0.1% |
usName
Text
| Distinct | 22978 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 51.0 MiB |
Length
| Max length | 50 |
|---|---|
| Median length | 44 |
| Mean length | 12.720442 |
| Min length | 1 |
Unique
| Unique | 12921 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | The Economist |
|---|---|
| 2nd row | CNN Business |
| 3rd row | FORTUNE |
| 4th row | Business Standard |
| 5th row | Reuters Business |
| Value | Count | Frequency (%) |
| bloomberg | 66759 | 4.2% |
| the | 63271 | 4.0% |
| business | 53561 | 3.4% |
| insider | 42829 | 2.7% |
| tim | 37593 | 2.4% |
| fargo | 37447 | 2.4% |
| news | 32959 | 2.1% |
| reuters | 28544 | 1.8% |
| quotes | 25315 | 1.6% |
| digital | 22231 | 1.4% |
| Other values (21870) | 1179163 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 951284 | 9.5% |
| 804352 | 8.0% | |
| o | 617337 | 6.2% |
| i | 606223 | 6.1% |
| r | 604599 | 6.0% |
| s | 600581 | 6.0% |
| a | 512714 | 5.1% |
| n | 499289 | 5.0% |
| t | 436832 | 4.4% |
| l | 383455 | 3.8% |
| Other values (1813) | 3980533 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7168105 | |
| Uppercase Letter | 1821774 | 18.2% |
| Space Separator | 804423 | 8.0% |
| Other Symbol | 85207 | 0.9% |
| Other Punctuation | 58912 | 0.6% |
| Decimal Number | 19252 | 0.2% |
| Nonspacing Mark | 11467 | 0.1% |
| Connector Punctuation | 8458 | 0.1% |
| Dash Punctuation | 5221 | 0.1% |
| Close Punctuation | 2981 | < 0.1% |
| Other values (13) | 11399 | 0.1% |
Most frequent character per category
Other Symbol
| Value | Count | Frequency (%) |
| 🔥 | 15405 | |
| ❤ | 11818 | 13.9% |
| ☘ | 4752 | 5.6% |
| ❄ | 4437 | 5.2% |
| 🤩 | 4110 | 4.8% |
| ✨ | 2874 | 3.4% |
| ™ | 2362 | 2.8% |
| 🚀 | 2314 | 2.7% |
| 🔞 | 2183 | 2.6% |
| 📽 | 2134 | 2.5% |
| Other values (646) | 32818 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 951284 | |
| o | 617337 | 8.6% |
| i | 606223 | 8.5% |
| r | 604599 | 8.4% |
| s | 600581 | 8.4% |
| a | 512714 | 7.2% |
| n | 499289 | 7.0% |
| t | 436832 | 6.1% |
| l | 383455 | 5.3% |
| u | 278335 | 3.9% |
| Other values (426) | 1677456 |
Other Letter
| Value | Count | Frequency (%) |
| ॐ | 1243 | |
| ㅤ | 143 | 5.0% |
| 者 | 103 | 3.6% |
| 勝 | 103 | 3.6% |
| ا | 94 | 3.3% |
| 火 | 75 | 2.6% |
| ر | 46 | 1.6% |
| 𓂀 | 35 | 1.2% |
| ن | 33 | 1.2% |
| ي | 33 | 1.2% |
| Other values (293) | 956 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 249121 | |
| B | 189526 | 10.4% |
| N | 155423 | 8.5% |
| I | 114118 | 6.3% |
| C | 109033 | 6.0% |
| S | 99600 | 5.5% |
| A | 99120 | 5.4% |
| M | 90357 | 5.0% |
| F | 82935 | 4.6% |
| P | 74428 | 4.1% |
| Other values (216) | 558113 |
Modifier Letter
| Value | Count | Frequency (%) |
| ᵗ | 32 | |
| ᵛ | 29 | |
| ˏ | 11 | 5.9% |
| ˊ | 11 | 5.9% |
| ˎ | 11 | 5.9% |
| ˋ | 11 | 5.9% |
| ʸ | 8 | 4.3% |
| ゚ | 8 | 4.3% |
| ᵒ | 5 | 2.7% |
| ˡ | 5 | 2.7% |
| Other values (31) | 54 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 27547 | |
| & | 10921 | 18.5% |
| # | 7889 | 13.4% |
| ' | 4041 | 6.9% |
| @ | 2121 | 3.6% |
| * | 1706 | 2.9% |
| , | 1319 | 2.2% |
| ! | 1308 | 2.2% |
| : | 580 | 1.0% |
| / | 536 | 0.9% |
| Other values (19) | 944 | 1.6% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ️ | 10961 | |
| ً | 266 | 2.3% |
| َ | 106 | 0.9% |
| ྂ | 57 | 0.5% |
| ໊ | 26 | 0.2% |
| ︎ | 14 | 0.1% |
| ं | 4 | < 0.1% |
| ् | 4 | < 0.1% |
| ͡ | 4 | < 0.1% |
| ᷈ | 4 | < 0.1% |
| Other values (16) | 21 | 0.2% |
Format
| Value | Count | Frequency (%) |
| | 267 | |
| | 177 | |
| | 68 | 10.3% |
| | 28 | 4.2% |
| | 24 | 3.6% |
| | 24 | 3.6% |
| | 20 | 3.0% |
| | 12 | 1.8% |
| | 12 | 1.8% |
| | 8 | 1.2% |
| Other values (7) | 23 | 3.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 7048 | |
| 1 | 4902 | |
| 5 | 1991 | 10.3% |
| 0 | 1809 | 9.4% |
| 7 | 1116 | 5.8% |
| 2 | 641 | 3.3% |
| 6 | 444 | 2.3% |
| 𝟤 | 416 | 2.2% |
| 8 | 306 | 1.6% |
| 3 | 300 | 1.6% |
| Other values (5) | 279 | 1.4% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2223 | |
| | | 155 | 6.1% |
| ~ | 74 | 2.9% |
| = | 45 | 1.8% |
| ∂ | 21 | 0.8% |
| > | 4 | 0.2% |
| < | 4 | 0.2% |
| ⊕ | 2 | 0.1% |
| ⧖ | 2 | 0.1% |
| ∎ | 2 | 0.1% |
| Other values (3) | 3 | 0.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| ₿ | 895 | |
| ¢ | 282 | 19.3% |
| $ | 184 | 12.6% |
| ฿ | 89 | 6.1% |
| ₳ | 3 | 0.2% |
| ₦ | 2 | 0.1% |
| ₥ | 1 | 0.1% |
| ₲ | 1 | 0.1% |
| ₣ | 1 | 0.1% |
| ₡ | 1 | 0.1% |
| Other values (2) | 2 | 0.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| 🏻 | 81 | |
| 🏼 | 60 | |
| 🏽 | 27 | 11.8% |
| ˗ | 22 | 9.6% |
| 🏾 | 14 | 6.1% |
| 🏿 | 9 | 3.9% |
| ` | 6 | 2.6% |
| ^ | 3 | 1.3% |
| ¨ | 2 | 0.9% |
| ¯ | 2 | 0.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2323 | |
| [ | 617 | 20.9% |
| { | 4 | 0.1% |
| ༺ | 3 | 0.1% |
| „ | 2 | 0.1% |
| ︽ | 2 | 0.1% |
| ( | 1 | < 0.1% |
| 【 | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2355 | |
| ] | 617 | 20.7% |
| } | 4 | 0.1% |
| ༻ | 3 | 0.1% |
| ) | 1 | < 0.1% |
| 】 | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5151 | |
| 〰 | 62 | 1.2% |
| — | 4 | 0.1% |
| – | 3 | 0.1% |
| 〜 | 1 | < 0.1% |
Spacing Mark
| Value | Count | Frequency (%) |
| ा | 8 | |
| ि | 2 | 15.4% |
| េ | 2 | 15.4% |
| ា | 1 | 7.7% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 433 | |
| ” | 13 | 2.9% |
| » | 2 | 0.4% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 22 | |
| ‘ | 3 | 11.1% |
| « | 2 | 7.4% |
Other Number
| Value | Count | Frequency (%) |
| ² | 3 | |
| ¹ | 1 | 20.0% |
| ➐ | 1 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 804352 | ||
| 71 | < 0.1% |
Private Use
| Value | Count | Frequency (%) |
| | 10 | |
| | 5 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8458 |
Enclosing Mark
| Value | Count | Frequency (%) |
| ⃣ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8971753 | |
| Common | 1006918 | 10.1% |
| Inherited | 11700 | 0.1% |
| Cyrillic | 1631 | < 0.1% |
| Devanagari | 1305 | < 0.1% |
| Georgian | 1199 | < 0.1% |
| Greek | 876 | < 0.1% |
| Han | 681 | < 0.1% |
| Arabic | 428 | < 0.1% |
| Hangul | 218 | < 0.1% |
| Other values (15) | 490 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 804352 | ||
| . | 27547 | 2.7% |
| 🔥 | 15405 | 1.5% |
| ❤ | 11818 | 1.2% |
| & | 10921 | 1.1% |
| _ | 8458 | 0.8% |
| # | 7889 | 0.8% |
| 9 | 7048 | 0.7% |
| - | 5151 | 0.5% |
| 1 | 4902 | 0.5% |
| Other values (1110) | 103427 | 10.3% |
Latin
| Value | Count | Frequency (%) |
| e | 951284 | 10.6% |
| o | 617337 | 6.9% |
| i | 606223 | 6.8% |
| r | 604599 | 6.7% |
| s | 600581 | 6.7% |
| a | 512714 | 5.7% |
| n | 499289 | 5.6% |
| t | 436832 | 4.9% |
| l | 383455 | 4.3% |
| u | 278335 | 3.1% |
| Other values (237) | 3481104 |
Han
| Value | Count | Frequency (%) |
| 者 | 103 | 15.1% |
| 勝 | 103 | 15.1% |
| 火 | 75 | 11.0% |
| 䂀 | 28 | 4.1% |
| 曼 | 28 | 4.1% |
| 劉 | 28 | 4.1% |
| 斯 | 14 | 2.1% |
| 大 | 10 | 1.5% |
| 丅 | 9 | 1.3% |
| 英 | 8 | 1.2% |
| Other values (110) | 275 |
Cyrillic
| Value | Count | Frequency (%) |
| ѕ | 394 | |
| є | 378 | |
| и | 176 | |
| я | 172 | |
| у | 146 | 9.0% |
| т | 83 | 5.1% |
| н | 26 | 1.6% |
| а | 23 | 1.4% |
| ҽ | 21 | 1.3% |
| С | 15 | 0.9% |
| Other values (41) | 197 |
Greek
| Value | Count | Frequency (%) |
| α | 227 | |
| σ | 201 | |
| ι | 190 | |
| ρ | 81 | 9.2% |
| υ | 36 | 4.1% |
| ο | 11 | 1.3% |
| ν | 11 | 1.3% |
| Σ | 9 | 1.0% |
| ω | 8 | 0.9% |
| Λ | 8 | 0.9% |
| Other values (32) | 94 |
Hangul
| Value | Count | Frequency (%) |
| ㅤ | 143 | |
| ㅏ | 10 | 4.6% |
| ㄹ | 6 | 2.8% |
| 송 | 5 | 2.3% |
| 재 | 4 | 1.8% |
| 준 | 4 | 1.8% |
| ㄴ | 4 | 1.8% |
| 소 | 3 | 1.4% |
| 단 | 3 | 1.4% |
| 방 | 3 | 1.4% |
| Other values (25) | 33 | 15.1% |
Arabic
| Value | Count | Frequency (%) |
| ا | 94 | |
| ر | 46 | |
| ن | 33 | 7.7% |
| ي | 33 | 7.7% |
| م | 31 | 7.2% |
| ل | 30 | 7.0% |
| ع | 23 | 5.4% |
| ف | 23 | 5.4% |
| س | 21 | 4.9% |
| ب | 18 | 4.2% |
| Other values (22) | 76 |
Katakana
| Value | Count | Frequency (%) |
| ツ | 11 | |
| ッ | 9 | |
| ニ | 3 | 5.8% |
| ダ | 2 | 3.8% |
| エ | 2 | 3.8% |
| ル | 2 | 3.8% |
| ン | 2 | 3.8% |
| レ | 2 | 3.8% |
| ミ | 1 | 1.9% |
| ャ | 1 | 1.9% |
| Other values (17) | 17 |
Devanagari
| Value | Count | Frequency (%) |
| ॐ | 1243 | |
| ा | 8 | 0.6% |
| न | 8 | 0.6% |
| ं | 4 | 0.3% |
| ् | 4 | 0.3% |
| य | 4 | 0.3% |
| ओ | 4 | 0.3% |
| ल | 2 | 0.2% |
| द | 2 | 0.2% |
| े | 2 | 0.2% |
| Other values (12) | 24 | 1.8% |
Thai
| Value | Count | Frequency (%) |
| า | 4 | 12.1% |
| ง | 4 | 12.1% |
| ๑ | 4 | 12.1% |
| น | 3 | 9.1% |
| ่ | 2 | 6.1% |
| ป | 2 | 6.1% |
| ร | 1 | 3.0% |
| ้ | 1 | 3.0% |
| อ | 1 | 3.0% |
| เ | 1 | 3.0% |
| Other values (10) | 10 |
Hiragana
| Value | Count | Frequency (%) |
| こ | 7 | |
| っ | 6 | |
| は | 6 | |
| じ | 6 | |
| い | 2 | 4.4% |
| の | 2 | 4.4% |
| つ | 2 | 4.4% |
| ば | 2 | 4.4% |
| さ | 2 | 4.4% |
| ろ | 2 | 4.4% |
| Other values (8) | 8 |
Hebrew
| Value | Count | Frequency (%) |
| ן | 5 | |
| ס | 2 | 8.3% |
| ם | 2 | 8.3% |
| ה | 2 | 8.3% |
| א | 1 | 4.2% |
| ח | 1 | 4.2% |
| ת | 1 | 4.2% |
| ד | 1 | 4.2% |
| ע | 1 | 4.2% |
| מ | 1 | 4.2% |
| Other values (7) | 7 |
Canadian_Aboriginal
| Value | Count | Frequency (%) |
| ᑎ | 12 | |
| ᗩ | 12 | |
| ᗴ | 9 | |
| ᖇ | 7 | |
| ᑌ | 4 | 6.1% |
| ᒪ | 3 | 4.5% |
| ᗪ | 3 | 4.5% |
| ᗰ | 3 | 4.5% |
| ᔕ | 3 | 4.5% |
| ᖶ | 2 | 3.0% |
| Other values (6) | 8 |
Inherited
| Value | Count | Frequency (%) |
| ️ | 10961 | |
| | 267 | 2.3% |
| ً | 266 | 2.3% |
| َ | 106 | 0.9% |
| | 68 | 0.6% |
| ︎ | 14 | 0.1% |
| ͡ | 4 | < 0.1% |
| ᷈ | 4 | < 0.1% |
| ͜ | 2 | < 0.1% |
| ̮ | 2 | < 0.1% |
| Other values (5) | 6 | 0.1% |
Khmer
| Value | Count | Frequency (%) |
| ឌ | 2 | |
| េ | 2 | |
| ម | 1 | |
| ប | 1 | |
| ូ | 1 | |
| ា | 1 | |
| ដ | 1 | |
| ល | 1 | |
| ី | 1 | |
| ឹ | 1 |
Armenian
| Value | Count | Frequency (%) |
| Ե | 39 | |
| օ | 18 | |
| վ | 13 | 13.3% |
| ղ | 11 | 11.2% |
| մ | 6 | 6.1% |
| Թ | 4 | 4.1% |
| հ | 4 | 4.1% |
| ց | 1 | 1.0% |
| ռ | 1 | 1.0% |
| ֆ | 1 | 1.0% |
Cherokee
| Value | Count | Frequency (%) |
| Ꭷ | 6 | |
| Ꮆ | 2 | 18.2% |
| Ꭿ | 1 | 9.1% |
| Ꮗ | 1 | 9.1% |
| Ꮙ | 1 | 9.1% |
Georgian
| Value | Count | Frequency (%) |
| ღ | 1192 | |
| ყ | 4 | 0.3% |
| ძ | 2 | 0.2% |
| ხ | 1 | 0.1% |
Tibetan
| Value | Count | Frequency (%) |
| ྂ | 57 | |
| ༺ | 3 | 4.8% |
| ༻ | 3 | 4.8% |
Unknown
| Value | Count | Frequency (%) |
| | 10 | |
| | 5 |
Gujarati
| Value | Count | Frequency (%) |
| ૯ | 6 | |
| ૐ | 1 | 14.3% |
Egyptian_Hieroglyphs
| Value | Count | Frequency (%) |
| 𓂀 | 35 |
Lao
| Value | Count | Frequency (%) |
| ໊ | 26 |
Tifinagh
| Value | Count | Frequency (%) |
| ⵜ | 1 |
Bopomofo
| Value | Count | Frequency (%) |
| ㄥ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9870911 | |
| None | 55352 | 0.6% |
| Dingbats | 19989 | 0.2% |
| Math Alphanum | 14669 | 0.1% |
| VS | 10975 | 0.1% |
| Misc Symbols | 10344 | 0.1% |
| Letterlike Symbols | 2555 | < 0.1% |
| Enclosed Alphanum Sup | 1956 | < 0.1% |
| Cyrillic | 1625 | < 0.1% |
| Devanagari | 1305 | < 0.1% |
| Other values (43) | 7518 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 951284 | 9.6% |
| 804352 | 8.1% | |
| o | 617337 | 6.3% |
| i | 606223 | 6.1% |
| r | 604599 | 6.1% |
| s | 600581 | 6.1% |
| a | 512714 | 5.2% |
| n | 499289 | 5.1% |
| t | 436832 | 4.4% |
| l | 383455 | 3.9% |
| Other values (83) | 3854245 |
None
| Value | Count | Frequency (%) |
| 🔥 | 15405 | |
| 🤩 | 4110 | 7.4% |
| 🚀 | 2314 | 4.2% |
| 🔞 | 2183 | 3.9% |
| 📽 | 2134 | 3.9% |
| ® | 2071 | 3.7% |
| ë | 1590 | 2.9% |
| 📈 | 1584 | 2.9% |
| 📭 | 1230 | 2.2% |
| 🗺 | 1146 | 2.1% |
| Other values (621) | 21585 |
Dingbats
| Value | Count | Frequency (%) |
| ❤ | 11818 | |
| ❄ | 4437 | 22.2% |
| ✨ | 2874 | 14.4% |
| ✌ | 239 | 1.2% |
| ✈ | 174 | 0.9% |
| ➶ | 91 | 0.5% |
| ❇ | 75 | 0.4% |
| ✒ | 32 | 0.2% |
| ✍ | 26 | 0.1% |
| ❓ | 26 | 0.1% |
| Other values (30) | 197 | 1.0% |
VS
| Value | Count | Frequency (%) |
| ️ | 10961 | |
| ︎ | 14 | 0.1% |
Misc Symbols
| Value | Count | Frequency (%) |
| ☘ | 4752 | |
| ⛩ | 2087 | |
| ♡ | 984 | 9.5% |
| ⚔ | 453 | 4.4% |
| ☄ | 382 | 3.7% |
| ⚡ | 248 | 2.4% |
| ☀ | 237 | 2.3% |
| ♤ | 208 | 2.0% |
| ⛵ | 95 | 0.9% |
| ★ | 91 | 0.9% |
| Other values (67) | 807 | 7.8% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 2362 | |
| ℓ | 159 | 6.2% |
| ℂ | 12 | 0.5% |
| ℕ | 8 | 0.3% |
| ℬ | 4 | 0.2% |
| ℰ | 2 | 0.1% |
| ℍ | 2 | 0.1% |
| ℳ | 1 | < 0.1% |
| ℭ | 1 | < 0.1% |
| ℝ | 1 | < 0.1% |
| Other values (3) | 3 | 0.1% |
Devanagari
| Value | Count | Frequency (%) |
| ॐ | 1243 | |
| ा | 8 | 0.6% |
| न | 8 | 0.6% |
| ं | 4 | 0.3% |
| ् | 4 | 0.3% |
| य | 4 | 0.3% |
| ओ | 4 | 0.3% |
| ल | 2 | 0.2% |
| द | 2 | 0.2% |
| े | 2 | 0.2% |
| Other values (12) | 24 | 1.8% |
Georgian
| Value | Count | Frequency (%) |
| ღ | 1192 | |
| ყ | 4 | 0.3% |
| ძ | 2 | 0.2% |
| ხ | 1 | 0.1% |
Math Alphanum
| Value | Count | Frequency (%) |
| 𝚎 | 992 | 6.8% |
| 𝖾 | 833 | 5.7% |
| 𝗌 | 832 | 5.7% |
| 𝚕 | 489 | 3.3% |
| 𝗋 | 417 | 2.8% |
| 𝗅 | 417 | 2.8% |
| 𝗈 | 417 | 2.8% |
| 𝖢 | 417 | 2.8% |
| 𝗏 | 416 | 2.8% |
| 𝟤 | 416 | 2.8% |
| Other values (318) | 9023 |
Currency Symbols
| Value | Count | Frequency (%) |
| ₿ | 895 | |
| ₳ | 3 | 0.3% |
| ₦ | 2 | 0.2% |
| ₥ | 1 | 0.1% |
| ₲ | 1 | 0.1% |
| ₣ | 1 | 0.1% |
| ₡ | 1 | 0.1% |
| ₪ | 1 | 0.1% |
| ₱ | 1 | 0.1% |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 433 | |
| | 267 | |
| | 177 | |
| • | 84 | 7.7% |
| | 68 | 6.2% |
| “ | 22 | 2.0% |
| ” | 13 | 1.2% |
| | 4 | 0.4% |
| — | 4 | 0.4% |
| † | 4 | 0.4% |
| Other values (8) | 13 | 1.2% |
Cyrillic
| Value | Count | Frequency (%) |
| ѕ | 394 | |
| є | 378 | |
| и | 176 | |
| я | 172 | |
| у | 146 | 9.0% |
| т | 83 | 5.1% |
| н | 26 | 1.6% |
| а | 23 | 1.4% |
| ҽ | 21 | 1.3% |
| С | 15 | 0.9% |
| Other values (36) | 191 |
Enclosed Alphanum Sup
| Value | Count | Frequency (%) |
| 🇺 | 291 | |
| 🇵 | 223 | |
| 🇷 | 177 | |
| 🇪 | 169 | |
| 🇸 | 163 | 8.3% |
| 🇦 | 133 | 6.8% |
| 🇧 | 119 | 6.1% |
| 🇨 | 109 | 5.6% |
| 🇬 | 101 | 5.2% |
| 🇮 | 63 | 3.2% |
| Other values (29) | 408 |
Arabic
| Value | Count | Frequency (%) |
| ً | 266 | |
| َ | 106 | 13.2% |
| ا | 94 | 11.7% |
| ر | 46 | 5.7% |
| ن | 33 | 4.1% |
| ي | 33 | 4.1% |
| م | 31 | 3.9% |
| ل | 30 | 3.7% |
| ع | 23 | 2.9% |
| ف | 23 | 2.9% |
| Other values (26) | 117 |
Compat Jamo
| Value | Count | Frequency (%) |
| ㅤ | 143 | |
| ㅏ | 10 | 6.1% |
| ㄹ | 6 | 3.7% |
| ㄴ | 4 | 2.5% |
Emoticons
| Value | Count | Frequency (%) |
| 😎 | 135 | |
| 😷 | 127 | |
| 😈 | 47 | 8.9% |
| 😘 | 31 | 5.9% |
| 😁 | 25 | 4.7% |
| 😃 | 20 | 3.8% |
| 😻 | 19 | 3.6% |
| 😮 | 13 | 2.5% |
| 😊 | 12 | 2.3% |
| 🙋 | 12 | 2.3% |
| Other values (21) | 86 |
Phonetic Ext
| Value | Count | Frequency (%) |
| ᴀ | 120 | |
| ᴇ | 77 | |
| ᴛ | 62 | |
| ᴜ | 35 | 6.6% |
| ᵗ | 32 | 6.0% |
| ᵛ | 29 | 5.5% |
| ᴠ | 28 | 5.3% |
| ᴡ | 22 | 4.1% |
| ᴊ | 22 | 4.1% |
| ᴄ | 19 | 3.6% |
| Other values (28) | 85 |
Specials
| Value | Count | Frequency (%) |
|  | 104 |
CJK
| Value | Count | Frequency (%) |
| 者 | 103 | 15.8% |
| 勝 | 103 | 15.8% |
| 火 | 75 | 11.5% |
| 曼 | 28 | 4.3% |
| 劉 | 28 | 4.3% |
| 斯 | 14 | 2.1% |
| 大 | 10 | 1.5% |
| 丅 | 9 | 1.4% |
| 英 | 8 | 1.2% |
| 若 | 8 | 1.2% |
| Other values (109) | 267 |
IPA Ext
| Value | Count | Frequency (%) |
| ɴ | 92 | |
| ɪ | 54 | |
| ɢ | 48 | |
| ɛ | 42 | |
| ɑ | 39 | |
| ʏ | 37 | |
| ʀ | 32 | 6.9% |
| ɾ | 25 | 5.4% |
| ʜ | 18 | 3.9% |
| ʟ | 17 | 3.6% |
| Other values (19) | 62 |
Thai
| Value | Count | Frequency (%) |
| ฿ | 89 | |
| า | 4 | 3.3% |
| ง | 4 | 3.3% |
| ๑ | 4 | 3.3% |
| น | 3 | 2.5% |
| ่ | 2 | 1.6% |
| ป | 2 | 1.6% |
| ร | 1 | 0.8% |
| ้ | 1 | 0.8% |
| อ | 1 | 0.8% |
| Other values (11) | 11 | 9.0% |
Tibetan
| Value | Count | Frequency (%) |
| ྂ | 57 | |
| ༺ | 3 | 4.8% |
| ༻ | 3 | 4.8% |
Armenian
| Value | Count | Frequency (%) |
| Ե | 39 | |
| օ | 18 | |
| վ | 13 | 13.3% |
| ղ | 11 | 11.2% |
| մ | 6 | 6.1% |
| Թ | 4 | 4.1% |
| հ | 4 | 4.1% |
| ց | 1 | 1.0% |
| ռ | 1 | 1.0% |
| ֆ | 1 | 1.0% |
Egyptian Hieroglyphs
| Value | Count | Frequency (%) |
| 𓂀 | 35 |
Enclosed Alphanum
| Value | Count | Frequency (%) |
| Ⓥ | 34 | |
| ⓡ | 2 | 3.8% |
| Ⓒ | 2 | 3.8% |
| ⓙ | 2 | 3.8% |
| ⓣ | 2 | 3.8% |
| Ⓑ | 2 | 3.8% |
| Ⓜ | 1 | 1.9% |
| ⓝ | 1 | 1.9% |
| ⓓ | 1 | 1.9% |
| ⓔ | 1 | 1.9% |
| Other values (4) | 4 | 7.7% |
CJK Ext A
| Value | Count | Frequency (%) |
| 䂀 | 28 |
Tags
| Value | Count | Frequency (%) |
| | 28 | |
| | 24 | |
| | 24 | |
| | 20 | |
| | 12 | |
| | 12 | |
| | 8 | 5.6% |
| | 8 | 5.6% |
| | 4 | 2.8% |
| | 4 | 2.8% |
Lao
| Value | Count | Frequency (%) |
| ໊ | 26 |
Modifier Letters
| Value | Count | Frequency (%) |
| ˗ | 22 | |
| ˏ | 11 | |
| ˊ | 11 | |
| ˎ | 11 | |
| ˋ | 11 | |
| ʸ | 8 | 9.3% |
| ˡ | 5 | 5.8% |
| ʰ | 2 | 2.3% |
| ˵ | 2 | 2.3% |
| ˢ | 1 | 1.2% |
| Other values (2) | 2 | 2.3% |
Math Operators
| Value | Count | Frequency (%) |
| ∂ | 21 | |
| ⊕ | 2 | 7.4% |
| ∎ | 2 | 7.4% |
| ∆ | 1 | 3.7% |
| ∞ | 1 | 3.7% |
Katakana
| Value | Count | Frequency (%) |
| ・ | 16 | |
| ツ | 11 | |
| ッ | 9 | |
| ニ | 3 | 4.5% |
| ー | 2 | 3.0% |
| ダ | 2 | 3.0% |
| エ | 2 | 3.0% |
| ル | 2 | 3.0% |
| ン | 2 | 3.0% |
| ミ | 1 | 1.5% |
| Other values (16) | 16 |
Geometric Shapes
| Value | Count | Frequency (%) |
| ◇ | 16 | |
| ○ | 2 | 8.3% |
| ▪ | 2 | 8.3% |
| ◕ | 2 | 8.3% |
| ▲ | 1 | 4.2% |
| ◾ | 1 | 4.2% |
Misc Technical
| Value | Count | Frequency (%) |
| ⏳ | 15 | |
| ⌛ | 1 | 5.9% |
| ⌨ | 1 | 5.9% |
UCAS
| Value | Count | Frequency (%) |
| ᑎ | 12 | |
| ᗩ | 12 | |
| ᗴ | 9 | |
| ᖇ | 7 | |
| ᑌ | 4 | 6.1% |
| ᒪ | 3 | 4.5% |
| ᗪ | 3 | 4.5% |
| ᗰ | 3 | 4.5% |
| ᔕ | 3 | 4.5% |
| ᖶ | 2 | 3.0% |
| Other values (6) | 8 |
PUA
| Value | Count | Frequency (%) |
| | 10 | |
| | 5 |
Hiragana
| Value | Count | Frequency (%) |
| こ | 7 | |
| っ | 6 | |
| は | 6 | |
| じ | 6 | |
| い | 2 | 4.4% |
| の | 2 | 4.4% |
| つ | 2 | 4.4% |
| ば | 2 | 4.4% |
| さ | 2 | 4.4% |
| ろ | 2 | 4.4% |
| Other values (8) | 8 |
Gujarati
| Value | Count | Frequency (%) |
| ૯ | 6 | |
| ૐ | 1 | 14.3% |
Cherokee
| Value | Count | Frequency (%) |
| Ꭷ | 6 | |
| Ꮆ | 2 | 18.2% |
| Ꭿ | 1 | 9.1% |
| Ꮗ | 1 | 9.1% |
| Ꮙ | 1 | 9.1% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ḳ | 5 | |
| Ẓ | 1 | 14.3% |
| ṅ | 1 | 14.3% |
Greek Ext
| Value | Count | Frequency (%) |
| ή | 5 |
Hebrew
| Value | Count | Frequency (%) |
| ן | 5 | |
| ס | 2 | 8.3% |
| ם | 2 | 8.3% |
| ה | 2 | 8.3% |
| א | 1 | 4.2% |
| ח | 1 | 4.2% |
| ת | 1 | 4.2% |
| ד | 1 | 4.2% |
| ע | 1 | 4.2% |
| מ | 1 | 4.2% |
| Other values (7) | 7 |
Hangul
| Value | Count | Frequency (%) |
| 송 | 5 | 9.3% |
| 재 | 4 | 7.4% |
| 준 | 4 | 7.4% |
| 소 | 3 | 5.6% |
| 단 | 3 | 5.6% |
| 방 | 3 | 5.6% |
| 탄 | 3 | 5.6% |
| 년 | 3 | 5.6% |
| 이 | 2 | 3.7% |
| 지 | 2 | 3.7% |
| Other values (20) | 22 |
Diacriticals
| Value | Count | Frequency (%) |
| ͡ | 4 | |
| ͜ | 2 | |
| ̮ | 2 | |
| ̷ | 1 | 10.0% |
| ́ | 1 | 10.0% |
Diacriticals Sup
| Value | Count | Frequency (%) |
| ᷈ | 4 |
Khmer
| Value | Count | Frequency (%) |
| ឌ | 2 | |
| េ | 2 | |
| ម | 1 | |
| ប | 1 | |
| ូ | 1 | |
| ា | 1 | |
| ដ | 1 | |
| ល | 1 | |
| ី | 1 | |
| ឹ | 1 |
Enclosed Ideographic Sup
| Value | Count | Frequency (%) |
| 🈴 | 2 | |
| 🈸 | 2 | |
| 🈷 | 2 | |
| 🈯 | 2 | |
| 🈶 | 2 | |
| 🈳 | 2 | |
| 🈹 | 2 |
Phonetic Ext Sup
| Value | Count | Frequency (%) |
| ᶻ | 2 | |
| ᶜ | 1 | |
| ᶠ | 1 |
CJK Compat Forms
| Value | Count | Frequency (%) |
| ︽ | 2 |
Cyrillic Sup
| Value | Count | Frequency (%) |
| ԋ | 2 | |
| ԃ | 1 | |
| Ԁ | 1 |
Jamo
| Value | Count | Frequency (%) |
| ᄂ | 1 |
Tifinagh
| Value | Count | Frequency (%) |
| ⵜ | 1 |
Bopomofo
| Value | Count | Frequency (%) |
| ㄥ | 1 |
Block Elements
| Value | Count | Frequency (%) |
| █ | 1 | |
| ▒ | 1 |
videoUrl
URL
Missing 
| Distinct | 77506 |
|---|---|
| Distinct (%) | 55.2% |
| Missing | 645425 |
| Missing (%) | 82.1% |
| Memory size | 38.9 MiB |
| https://video.twimg.com/amplify_video/1098673630966358016/vid/640x360/s-QArpal8Y5g-1Co.mp4?tag=9 | 127 |
|---|---|
| https://video.twimg.com/amplify_video/1126626046558625792/pl/JEInfD21OO9wxmag.m3u8?tag=12 | 126 |
| https://video.twimg.com/ext_tw_video/1108764347713703937/pu/vid/320x180/zRBXk6lGOg0MhciE.mp4?tag=8 | 110 |
| https://video.twimg.com/amplify_video/1092622594489139203/vid/320x180/7qZRbTf5BH1k7kjF.mp4?tag=9 | 103 |
| https://video.twimg.com/amplify_video/1100527279334219776/vid/320x180/nSst4z6uKgBVp9M2.mp4?tag=9 | 100 |
| Other values (77501) | |
| (Missing) |
| Value | Count | Frequency (%) |
| https://video.twimg.com/amplify_video/1098673630966358016/vid/640x360/s-QArpal8Y5g-1Co.mp4?tag=9 | 127 | < 0.1% |
| https://video.twimg.com/amplify_video/1126626046558625792/pl/JEInfD21OO9wxmag.m3u8?tag=12 | 126 | < 0.1% |
| https://video.twimg.com/ext_tw_video/1108764347713703937/pu/vid/320x180/zRBXk6lGOg0MhciE.mp4?tag=8 | 110 | < 0.1% |
| https://video.twimg.com/amplify_video/1092622594489139203/vid/320x180/7qZRbTf5BH1k7kjF.mp4?tag=9 | 103 | < 0.1% |
| https://video.twimg.com/amplify_video/1100527279334219776/vid/320x180/nSst4z6uKgBVp9M2.mp4?tag=9 | 100 | < 0.1% |
| https://video.twimg.com/ext_tw_video/1111845697232355329/pu/pl/M1dSwPCbKpjCJyOB.m3u8?tag=8 | 99 | < 0.1% |
| https://video.twimg.com/amplify_video/1104512132241113088/pl/8HfdRDRy6jgn4AxA.m3u8?tag=11 | 78 | < 0.1% |
| https://video.twimg.com/amplify_video/1104509561636048897/pl/Zsll6fXKKJ-m7kQD.m3u8?tag=11 | 78 | < 0.1% |
| https://video.twimg.com/amplify_video/1104508068052774912/vid/432x180/I5aWgI_WJVeb93w7.mp4?tag=11 | 77 | < 0.1% |
| https://video.twimg.com/ext_tw_video/1098670036225536000/pu/vid/360x640/8KDXU-GAsAa5_mRb.mp4?tag=6 | 76 | < 0.1% |
| Other values (77496) | 139517 | 17.8% |
| (Missing) | 645425 |
| Value | Count | Frequency (%) |
| https | 140491 | 17.9% |
| (Missing) | 645425 |
| Value | Count | Frequency (%) |
| video.twimg.com | 140491 | 17.9% |
| (Missing) | 645425 |
| Value | Count | Frequency (%) |
| /amplify_video/1098673630966358016/vid/640x360/s-QArpal8Y5g-1Co.mp4 | 127 | < 0.1% |
| /amplify_video/1126626046558625792/pl/JEInfD21OO9wxmag.m3u8 | 126 | < 0.1% |
| /ext_tw_video/1108764347713703937/pu/vid/320x180/zRBXk6lGOg0MhciE.mp4 | 110 | < 0.1% |
| /amplify_video/1092622594489139203/vid/320x180/7qZRbTf5BH1k7kjF.mp4 | 103 | < 0.1% |
| /amplify_video/1100527279334219776/vid/320x180/nSst4z6uKgBVp9M2.mp4 | 100 | < 0.1% |
| /ext_tw_video/1111845697232355329/pu/pl/M1dSwPCbKpjCJyOB.m3u8 | 99 | < 0.1% |
| /amplify_video/1104509561636048897/pl/Zsll6fXKKJ-m7kQD.m3u8 | 78 | < 0.1% |
| /amplify_video/1104512132241113088/pl/8HfdRDRy6jgn4AxA.m3u8 | 78 | < 0.1% |
| /amplify_video/1104508068052774912/vid/432x180/I5aWgI_WJVeb93w7.mp4 | 77 | < 0.1% |
| /ext_tw_video/1098670036225536000/pu/vid/360x640/8KDXU-GAsAa5_mRb.mp4 | 76 | < 0.1% |
| Other values (77496) | 139517 | 17.8% |
| (Missing) | 645425 |
| Value | Count | Frequency (%) |
| tag=9 | 26957 | 3.4% |
| tag=6 | 20364 | 2.6% |
| tag=8 | 19238 | 2.4% |
| tag=13 | 17994 | 2.3% |
| 15412 | 2.0% | |
| tag=11 | 15114 | 1.9% |
| tag=10 | 12847 | 1.6% |
| tag=12 | 6854 | 0.9% |
| tag=2 | 1994 | 0.3% |
| tag=5 | 1728 | 0.2% |
| Other values (4) | 1989 | 0.3% |
| (Missing) | 645425 |
| Value | Count | Frequency (%) |
| 140491 | 17.9% | |
| (Missing) | 645425 |
Interactions
Correlations
| edInput | editor | engages | isApproved | isRT | likes | retweets | rtUsID | topicName | tweetID | usFlwrs | usID | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| edInput | 1.000 | 0.578 | 0.003 | 0.964 | 0.166 | 0.003 | 0.003 | 0.069 | 0.258 | 0.026 | 0.083 | 0.069 |
| editor | 0.578 | 1.000 | 0.168 | 0.622 | 0.015 | 0.180 | 0.122 | -0.022 | 0.207 | 0.439 | -0.113 | 0.100 |
| engages | 0.003 | 0.168 | 1.000 | 0.003 | 0.029 | 0.994 | 0.956 | 0.148 | 0.022 | -0.002 | 0.136 | 0.152 |
| isApproved | 0.964 | 0.622 | 0.003 | 1.000 | 0.108 | 0.004 | 0.003 | 0.096 | 0.356 | 0.036 | 0.085 | 0.072 |
| isRT | 0.166 | 0.015 | 0.029 | 0.108 | 1.000 | 0.029 | 0.021 | 0.508 | 0.395 | 0.130 | 0.159 | 0.347 |
| likes | 0.003 | 0.180 | 0.994 | 0.004 | 0.029 | 1.000 | 0.921 | 0.150 | 0.023 | 0.007 | 0.112 | 0.165 |
| retweets | 0.003 | 0.122 | 0.956 | 0.003 | 0.021 | 0.921 | 1.000 | 0.129 | 0.011 | -0.031 | 0.199 | 0.099 |
| rtUsID | 0.069 | -0.022 | 0.148 | 0.096 | 0.508 | 0.150 | 0.129 | 1.000 | 0.376 | -0.101 | -0.511 | 0.388 |
| topicName | 0.258 | 0.207 | 0.022 | 0.356 | 0.395 | 0.023 | 0.011 | 0.376 | 1.000 | 0.045 | 0.268 | 0.303 |
| tweetID | 0.026 | 0.439 | -0.002 | 0.036 | 0.130 | 0.007 | -0.031 | -0.101 | 0.045 | 1.000 | 0.072 | -0.117 |
| usFlwrs | 0.083 | -0.113 | 0.136 | 0.085 | 0.159 | 0.112 | 0.199 | -0.511 | 0.268 | 0.072 | 1.000 | -0.721 |
| usID | 0.069 | 0.100 | 0.152 | 0.072 | 0.347 | 0.165 | 0.099 | 0.388 | 0.303 | -0.117 | -0.721 | 1.000 |
Missing values
Sample
| tweetID | crDate | edInput | editor | engages | isApproved | isEdNeed | isRT | likes | photoUrl | retweets | rtUsID | text | topicName | usFlwrs | usID | usName | videoUrl | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1070867471245164544 | 2018-12-07 02:27:55 | -1 | -1 | 98 | False | True | False | 64 | https://pbs.twimg.com/media/Dtx8SiIWkAImVsb.jpg | 34 | -1 | The immediate impulse for an alliance of the EU's northern states is Brexit https://t.co/nlhUD36hay https://t.co/shwMWpjjuK | Business | 23464532 | 5988062 | The Economist | NaN |
| 1 | 1070868017888837633 | 2018-12-07 02:30:05 | -1 | -1 | 13 | False | True | False | 10 | https://pbs.twimg.com/media/Dtx8yTyW4AEciqP.jpg | 3 | -1 | America's economy is flashing some warning signs, but -- for now -- the labor market appears to be going strong https://t.co/xvCPgtqMzy https://t.co/0sQdzAsME3 | Business | 1732809 | 16184358 | CNN Business | NaN |
| 2 | 1070868012864028673 | 2018-12-07 02:30:04 | -1 | -1 | 12 | False | True | False | 8 | NaN | 4 | -1 | Lyft files for what is expected to be one of the hottest IPOs in 2019 https://t.co/qEjyniazlD | Business | 2253989 | 25053299 | FORTUNE | NaN |
| 3 | 1070867995239555075 | 2018-12-07 02:30:00 | -1 | -1 | 5 | False | True | False | 4 | NaN | 1 | -1 | Exporters still waiting to get Rs 6,000 crore worth of input tax credit refunds\n\nMany being denied tax refunds by state governments, such as Andhra Pradesh, Uttar Pradesh, Bihar and Chhattisgarh, who say they are cash starved\n\n@Subhayan_ism @GST_Council\n\nhttps://t.co/QRBg8b98Rr | Business | 1704056 | 43855487 | Business Standard | NaN |
| 4 | 1070867995205885952 | 2018-12-07 02:30:00 | -1 | -1 | 5 | False | True | False | 2 | NaN | 3 | -1 | Ride-hailing firm Lyft races to leave Uber behind in IPO chase https://t.co/0qCsdx2LYS https://t.co/gHZLUntYkL | Business | 1997662 | 15110357 | Reuters Business | https://video.twimg.com/amplify_video/1070811671948681226/vid/320x180/5hE60WR-z0Q537YU.mp4?tag=9 |
| 5 | 1070868019600076802 | 2018-12-07 02:30:06 | -1 | -1 | 1116 | False | True | False | 793 | NaN | 323 | -1 | Jaguar hugs! https://t.co/l1ICUSyjp7 | Animal | 68526 | 942754965528895488 | I_love_nature | https://video.twimg.com/ext_tw_video/1070363423303622656/pu/pl/i8Yo-hLiaVvItD_9.m3u8?tag=6 |
| 6 | 1070868102160769025 | 2018-12-07 02:30:25 | -1 | -1 | 31 | False | True | False | 17 | https://pbs.twimg.com/media/Dtx83JvX4AE48aw.jpg | 14 | -1 | -Asian stocks post modest gains \n-S&P 500 futures little changed\n-10-year Treasury yields stayed near 2.90%\n-Oil continues to be a drag on sentiment\n-Next up for embattled traders: the monthly U.S. payrolls report\nhttps://t.co/8cwVkXpoWQ https://t.co/EBp6jNaJP3 | Business | 5033221 | 34713362 | Bloomberg | NaN |
| 7 | 1070868071844376576 | 2018-12-07 02:30:18 | -1 | -1 | 9 | False | True | False | 7 | NaN | 2 | -1 | What's your pick? https://t.co/a0nnFRqIQ3 | Business | 2318088 | 2735591 | Fast Company | NaN |
| 8 | 1070868063359262720 | 2018-12-07 02:30:16 | -1 | -1 | 4 | False | True | False | 4 | NaN | 0 | -1 | Dick's CEO Ed Stack totals up how many employees quit over assault-style weapon decision https://t.co/kgOa4CQ5NR | Business | 48736 | 14921083 | Business Journals | NaN |
| 9 | 1070868025887387648 | 2018-12-07 02:30:07 | -1 | -1 | 90 | False | True | False | 63 | NaN | 27 | -1 | A meeting of tech leaders at the White House marked an easing of tensions between Washington and Silicon Valley https://t.co/blFfALEgIL | Business | 16198522 | 3108351 | The Wall Street Journal | NaN |
| tweetID | crDate | edInput | editor | engages | isApproved | isEdNeed | isRT | likes | photoUrl | retweets | rtUsID | text | topicName | usFlwrs | usID | usName | videoUrl | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 785906 | 1153093029617532928 | 2019-07-22 00:02:56 | -1 | -1 | 711 | False | True | True | 544 | NaN | 167 | 865670110593155072 | like and retweet this if you think it's a vibe!!! 🔥✈️✈️✈️ https://t.co/A9J0MCtvDI | Animal | 145 | 1100521587147591680 | icecreampat | https://video.twimg.com/ext_tw_video/1153090666596978688/pu/pl/Fxm1w0_pmSchNwsU.m3u8?tag=10 |
| 785907 | 1153986232407347201 | 2019-07-24 11:12:12 | -1 | -1 | 1560 | False | True | True | 893 | https://pbs.twimg.com/media/D_Rm_h7VAAEyEe5.jpg | 667 | 2271543909 | P O S I T I V I T Y https://t.co/oPo4j8bvh8 | Funny | 164127 | 768030487646396416 | My Feeling | NaN |
| 785908 | 1154078897220100097 | 2019-07-24 17:20:25 | -1 | -1 | 864 | False | True | True | 523 | https://pbs.twimg.com/media/D90uCeWUcAAapsN.jpg | 341 | 2271543909 | G O O D V I B E S O N L Y https://t.co/39nIzxdOz3 | Funny | 164127 | 768030487646396416 | My Feeling | NaN |
| 785909 | 1153738404636573696 | 2019-07-23 18:47:25 | -1 | -1 | 229 | False | True | True | 191 | NaN | 38 | 781427301472874497 | Skull Basher Axe 👀\n\nLike if you want this piece \n\nOn Sale: https://t.co/01QF855say https://t.co/byC3NKoPop | Interesting | 45615 | 778150181468512256 | Blade City | https://video.twimg.com/ext_tw_video/1153738015581331462/pu/vid/320x320/p2UV3CxapFMGWmdb.mp4?tag=10 |
| 785910 | 1135277475817304065 | 2019-06-02 20:10:17 | -1 | -1 | 1464 | False | True | True | 1303 | NaN | 161 | 2355808260 | The best part of David Lynch's "Eraserhead Stories" is when he talks about buying a supermarket Dutch apple pie for the same price as a single slice at the Hamburger Hamlet and then subsequently sneaking his own slices into the Hamburger Hamlet, describing it as "a real thrill." | Photography | 39896 | 421542096 | Ari Aster | NaN |
| 785911 | 1147325851614117888 | 2019-07-06 02:06:13 | -1 | -1 | 3 | False | True | True | 1 | NaN | 2 | 542154137 | Relations are DIFFERENT\nnot DIFFICULT. | Motivational | 85625 | 542154137 | Wit & Wisdom 💯 | NaN |
| 785912 | 1153184058714624001 | 2019-07-22 06:04:39 | -1 | -1 | 867 | False | True | True | 561 | https://pbs.twimg.com/media/EADuxohU8AAQo8G.jpg | 306 | 858516111410647040 | "to live a creative life, we must lose our fear of being wrong"......... https://t.co/LF0e0xV5Q7 | Interesting | 208417 | 2920686840 | DeepFeling™ | NaN |
| 785913 | 1153048802116292608 | 2019-07-21 21:07:11 | -1 | -1 | 4605 | False | True | True | 4253 | NaN | 352 | 3282859598 | Who's your comic crush? https://t.co/H29dhXw3kf | Memes | 7024207 | 436266454 | Twitter Movies | https://video.twimg.com/amplify_video/1153047495326355457/vid/1280x720/P996nFjt3ncUO467.mp4?tag=13 |
| 785914 | 1154063052997836801 | 2019-07-24 16:17:27 | -1 | -1 | 5638 | True | True | False | 4996 | https://pbs.twimg.com/media/EAQOObJWwAASaxj.jpg | 642 | -1 | After a flight of 195 hours, 18 minutes, 35 seconds - the #Apollo11 crew splashed down in the North Pacific Ocean, 900 miles southwest of Hawaii! Here’s a photo of their recovery as we celebrate the #Apollo50th anniversary: https://t.co/Y4zhGTQlPj https://t.co/fBpvcECsjp | Random | 32030797 | 11348282 | NASA | NaN |
| 785915 | 1073723027718688768 | 2018-12-14 23:34:53 | -1 | -1 | 4181 | False | True | True | 3282 | https://pbs.twimg.com/media/DuahZZeUYAA7-55.jpg | 899 | 2355808260 | Scarface's Action Figure Tony Montana cutting open a pack of Flour on a kitchen table\n(by artist VSE OK) https://t.co/vOqvOh7EFn | Photography | 606924 | 2355808260 | 41 Strange | NaN |